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Abstract 

Adaptive Optics is a prime example of how progress in observational astronomy can be 
driven by technological developments. At many observatories it is now considered to be 
part of a standard instrumentation suite, enabling ground-based telescopes to reach the 
diffraction limit and thus providing spatial resolution superior to that achievable from space 
with current or planned satellites. In this review we consider adaptive optics from the as- 
trophysical perspective. We show that adaptive optics has led to important advances in our 
understanding of a multitude of astrophysical processes, and describe how the requirements 
from science applications are now driving the development of the next generation of novel 
adaptive optics techniques. 



1 Introduction 

The first successful on-sky test of an astronomical adaptive optics (AO) system was re- 
ported with the wo rds "An old dream of ground-based astronomers has finally come true" 
(jMerkle et al. 19891 ). This jubilant mood resulted from successfully reaching the near- 



infrared diffraction limit of a 1.5-m telescope. Since then, both the technology and ex- 
pectations of AO systems have advanced considerably. The current state of the art is 
shown in Fig. [1] Here, the adaptive secondary AO system of the Large Binocular Tele- 
scope (LBT), which has 8.4-m primary mirror s, recorded a phenomenal 85% Strehl ratio 
in the H-band (1.65 /im) ( Esposito et al. 2Q10h . In parallel to improving the performance 



at near-infrared wavelengths, there is an effor t to reach the diff raction limit in the opti- 



cal. Using AO with on-the-fly image selection, lLaw et al. (2009al ) achieved a resolution of 
35 milliarcsec (mas) , the 700 nm diffraction limit of the 5-m telescope at Palomar Observa- 
tory. This is close to the highest resolution direct optical image, which had a FWHM of 
22 mas and was achieved at 850 nm on the 10.2-m Keck II telescope in good atmospheric 
conditions ( Wizinowich et al. 2000h . 
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Figure 1: 

Images from commissioning of the LBT adaptive secondary AO system that demonstrate the 
capabilities of modern adaptive optics. Left: the double star 53 Bootes is well resolved even 
though the separation is only 42 mas. Right panel: the H-band correction on bright stars is so 
good that one can count up to 10 diffraction rings, some of which are slightly fragmented by 
residual uncorrected aberrations. Beyond this "dark hole" (see S ec. I2.3|>. the PSF brig htens 
because the very high orders remain uncorrected. Adapted from lEsposito et al. (2010 ) (courtesy 
of S. Esposito). 

These technical demonstrations illustrate the outstanding performance of current AO sys- 
tems. As such, they prompt the question that has prevailed through the early development 
of AO systems, "Here's what AO does, can we find a use for it?". In contrast, the real util- 
ity of such a capability is only demonstrated by its successful scientific applications - and 
AO observations are now leading to major new insights and advances in our understanding 
of the physical mechanisms at work in the universe. It is this astrophysical perspective, 
which is now driving the development of future and novel adaptive optics techniques, that 
we wish to emphasize in this review. By doing so, our aim is to motivate astronomers to 
ask instead, "Here's my science, how can AO enhance it?" 

To set the context, we begin in Sec. [2] with a brief description of a simple AO system, 
highlighting key issues related to the current generation of AO systems. We then turn to the 
astrophysical applications in Sec. [3] discussing a few examples drawn from a wide variety of 
fields. A common conceptual limitation of AO concerns the point spread function (PSF), 
about which some knowledge is required for most science applications. Sec. 2] discusses the 
different ways in which the PSF can be used for scientific analysis, the level of detail to which 
it must be known, and how it might be measured. While simple adaptive optics systems 
are already in use at many observatories, considerable effort is being made to demonstrate 
novel techniques that will vastly broaden their potential. Sec. [S] describes how these AO 
systems are being tailored to meet the specific requirements of diverse science cases. We 
finish in Sec. [6] by asking whether there are lessons to be learned that might guide future 
AO development, to further augment and broaden its scientific impact. 

2 Basic Adaptive Optics 

For centuries, astronomers have lived with blurry images of their targets when looking 
through ground-based telescopes. The images were blurred by the astronomical seeing 
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Figure 2: 



First astronomical AO corrected image obtained with COME-ON l|Rousset et al. 1990h . 



that originates from the light passing through the turbulent refractive atmosphere before 
reaching the Earth's surface. Similar to the effect when looking across a camp-fire, air 
cells of different temperature, density and hence refractive index introduce spatial and 
temporal variations of the optical path length along the line of sight. Astronomers try to 
mitigate these detrimental effects of the atmosphere by building observatories on mountain 
tops, and ultimately launching them into space. However, in 1953 the Mount Wilson and 
Palomar astronomer Horace W. Babock had a ground-breaking idea: "If we had the means 
of continually measuring the deviation of rays from all parts of the mirror, and of amplifying 
and feeding back this information so as to correct locally the figure of the mirror in response 
to the schlieren pattern, we could expect t o compensate both for the seeing and for any 
inherent imperfection of the optical figure. " (jBabcock 19531 ). It still took more than 30 years 
until technology had matured to a level supporting a practical implementation of Babcock's 
concept, now called adaptive optics (AO). While the US military started to invest in AO 
in the 1970s and had commissioned the first practical adaptive optical system, called the 
Compensated Imaging System, on the 1.6-m telescope on top of Haleakala on the island of 
Maui in 1982, it was in the late 1980 s that the first astronomical AO instrum ent COME-ON 
was tested at the 1.52-m telescope ( Merkle et al. 19891 . Rousset et al. 199ol . Fig. [2} of the 
Observat oire de Haute-Prov ence and later installed at ESO's 3.6-m telescope on La Silla 
in Chile (IRigaut et al. 19911 ). A thorough summary of the history of Adaptive Optics was 



presented bv lBeckers (19931 ) 
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2.1 The Principle of AO 

Atmospheric turbulence introduces spatial and temporal variations in the refractive index 
and the optical path length along the line of sight. Those variations mostly originate in 
the troposphere which typically extends up to a height of about 15 km and contains 80% of 
the atmosphere's mass. Large scale temperature variations produce pressure gradients and 
winds which lead to turbulent mixing of air with variable density and hence refractive index. 
These refractive inde x variation s show a very small dispersion between visible and mid- 



infrared wavelengths (|Cox 20011 ). so optical path variations are approximately achromatic 
over this spectral range. 

According to the Kolmogorov model, energy is injected and mostly contained at large 
scales and then dissipated to smaller scales which carry less and less energy. Hence, also the 
optical path variations carry most energy at small spatial frequencies, i.e., at large scales. 

The vertical distribution of the strength of refractive index variations through atmo- 
spheric turbulence as a function of height z is described by the continuous C\{z) profile. It 
is typically modeled by a number of thin turbulent layers of variable strength and height 
which move at different speeds in different directions (frozen flow hypothesis). In the near- 
field approximation the optical path variations introduced by each layer simply sum up. 
Amplitude or scintillation effects are neglected, which is a good approximation for most 
astronomical AO applications. There are three main atmospheric parameters that drive 
the design and performance of AO systems on telescopes up to 8-10-m class: 

• The Fried parameter, ro oc [\~ 2 (cosj)~ 1 J Cn(z)dz]~ 3 ^ 5 , gives the aperture over 
which there is on average one radian of root mean square (rms) phase aberration. 
If an AO system aims at achieving a moderate Strehl ratio of around 40%, it needs to 
correct on spatial scales of ro. Since ro oc A 6//5 , a good correction at longer wavelengths 
requires coarser sampling of the telescope pupil. Obviously, ro shrinks at larger zenith 
angles 7 and with increasing strength of refractive index variations, ro also happens 
to be the aperture which has about the same full-width at half maximum (FWHM) 
image resolution as a diffraction limited aperture in the absence of turbulence. This 
gives rise to astronomical seeing, for which FWHM ~ A/ro. ro is of the order 10 cm 
at visible wavelengths in 1" seeing. 

• The isoplanatic angle, 60 oc (cos^ro/h, with h denoting the characteristic height of 
the turbulence, describes the angle out to which optical path variations deviate by 
less than one radian rms phase aberration from each other. Given a certain correc- 
tion direction, do provides the maximum angular radius from this direction at which 
reasonably good correction is achieved. 80 is typically of order a few arcsec at visible 
wavelengths and strongly depends on the height distribution of the turbulent layers. 

• The coherence time, to oc ro/v, with v denoting the average wind speed, describes 
the time interval up to which optical path variations deviate by less than one radian 
rms phase aberration from each other, ro therefore defines the required AO temporal 
correction bandwidth, which is typically a few milliseconds at visible wavelengths. 

A fourth parameter that becomes increasingly important for larger telescopes - notably 
the future 30-40-m class extremely large telescopes (ELTs) - is the outer scale Lo, which is 
typically a few tens of metres although it can be larger. A wavefront that has propagated 
through the atmosphere does not de-correlate any further on size scales greater than Lo- 
This directly influences the performance of an AO system, most dramatically when Lo is 
comparable to the size of the telescope aperture. Thus, for the ELTs, Lo may become as 
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important as ro, 9a, and to for deciding which AO science programmes can be observed on 
a given night. 

The principle of AO is depicted in Fig. [3] The AO system tries to regulate the optical 
path variations (wavefront) by measuring the deviations using a wavefront sensor (WFS), 
calculating an appropriate correction, and applying this correction to a deformable mirror 
(DM). This feedback loop is carried out several hundred times a second in order to comply 
with the temporal bandwidth requirement set by to. The size of the resolution elements of 
the wavefront sensor (subapertures) and the deformable mirror (actuators) projected on the 
telescope entrance aperture should approximately match with ro. The depicted setup uses 
WFS measurements of a single guide star to correct the wavefront in its direction. Such a 
setup is most simple, widely used, and called Single-Conjugated Adaptive Optics (SCAO). 
SCAO suffers from image degradation over the field of view set by 9o- There is a wealth of 
other concepts involving multiple-guide stars and/or DMs as well as more complex control 
strategies which will be introduced in Sec. [5] 

Since AO wavefront sensing requires a light-source above the atmosphere and near to the 
astronomical object, it is very often photon starved, and a good sensitivity of the wavefront 
sensor is essential. In many cases, a visually bright enough natural guide star (NGS) is not 
available. While astronomical targets that are cold or obscured by dust may still be bright 
enough for wavefront sensing in the near-infrared (near-IR, 1-2.5 /im), the ultimate way to 
achieve a decent sky coverage is to create one's own guide star where needed. These laser 
guide stars (LGS) are introduced in Sec. 12.41 

The number of resolution elements for current AO systems at 8-10-m class telescopes 
correcting in the near-IR is typically a few hundred. In the standard approach, the com- 
plexity of a single wavefront calculation scales with this number squared, and the number 
of calculations per second is inversely proportional to ro. Hence, the complexity of an AO 
system roughly scales with r^ a or A~ 187/5 , so it is much easier to achieve a good correction 
at longer wavelengths. 



2.2 Key Components of an AO system 

2.2.1 Wavefront Sensing The objective of the WFS is to provide a signal with which 
the shape of the wavefront can be estimated with sufficient accuracy. It generally incor- 
porates a phase-sensitive optical device or sensing scheme and a low noise, high quantum 
efficiency, photon detector. Since the wavefront is nearly achromatic, wavefront sensing is 
typically done at visible wavelengths where detector technology - Charge- Coupled Devices 
(CCDs) or Avalanche Photo Diodes (APDs) - is most mature and state-of-the-art detectors 
have quantum efficiency near one and read-noise near zero. 

Three flavours of WFS are currently used in AO: the Pyramid WFS, Shack-Hartmann 
WFS and the curvature WFS. They all work with broad-band light, but differ in dynamic 
range and sensitivity. While the dynamic range is less important for closed-loop systems 
which are supposed to operate on small residual errors, the sensitivity (i.e. noise propagation 
properties) is the parameter that is to be traded against practical issues like technological 
feasibility. Fig. [4] shows the working principle of the three WFS mentioned above. 

The Shack-Hartmann WFS employs an array of lenslets across the aperture which pro- 
duce an array of spots corresponding to the local wavefront. The positions of these spots 
represent the average wavefront slope o r gradient over the subaperture. 

The Pyramid WFS (jRaeazzoni 1996h very much represents the Shack-Hartmann WFS 
when the pyramid (or prism in Fig. [4]) is modulated. When an aberrated ray hits the prism 
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Figure 3: 

Adaptive optics working principle (courtesy of S. Hippler). 



on either side of its tip, it appears in only one of the multiple pupils as displayed by the upper 
left panel of Fig. [4] The intensity distributions in the multiple pupil images are therefore 
a measure for the sign of the ray's slope. If the prism modulates, the ray will appear in 
either of the pupil images depending on the modulus of the local slope. Thus, the intensity 
distribution integrated over a couple of modulations also measures wavefront slopes in the 
pupil. The Pyramid WFS, however, offers the flexibility to adjust the modulation amplitude 
and hence the sensitivity of the sensor to the observing conditions. In the extreme case 
of no modulation at all, the Pyramid WFS very much behaves like an interferometer and 
measures the phase directly rather than its slope (jVerinaud et al. 20051 ). 

The curvature WFS measures intensity distributions in two different plan es on either sid e 
of the focus, corresponding to the wavefront's curvature or 2nd derivative ( Roddier 19881 ). 
Wavefront shapes with zero curvature (like tip, tilt and astigmatism) are measured through 
the beam circumference only. The beauty of the curvature WFS is its simplicity and ease 
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Figure 4: 

Schematic drawings of the three main WFS working principles. 



of use. Using an oscillating membrane in the focal plane, a truly differential measurement 
can be performed by a single pixel detector per subaperture, well suited for APDs, which 
h ave the same q uantum efficiency as a CCD but virtually zero read-out noise and delay. 
Guvon (2005) carried out a theoretical study of noise propagation properties of the dif- 



ferent WFS for high-contrast imaging applications. While a phase sensor like the non- 
modulated Pyramid WFS has equal sensitivity at all spatial frequencies, the sensitivity of 
a slope or 1st derivative sensor like the modulated Pyramid WFS or the Shack-Hartmann 
WFS degrades towards low spatial frequencies. This degradation is even more prominent 
for the curvature WFS where low spatial frequencies are only poorly sensed. The curva- 
ture WFS, however, has a better sensitivity than the Shack-Hartmann WFS at high spatial 
frequencies, a nd there are ways to cope with the poor low spatial frequency performance 



( Guvon 20071 1. Nevertheless, the high sensitivity and flexibility advocate the Pyramid WFS 
as the best choice for modern AO syste ms, a conclusion whic h is supported by the impressive 
recent on-sky results shown in Fig. [T1 ( Esposito et al. 20101 ) . 



2.2.2 Wavefront Reconstruction The topic of wavefront reconstruction addresses 
the calculation of an appropriate correction vector v (containing the voltages sent to the 
DM) from the WFS measurement vector s (containing, for example, all the slopes mea- 
sured by a Shack-Hartmann sensor). Since the WFS can be assumed to be operated in an 
approximately linear regime when in closed loop, wavefront reconstruction is described by 
the linear system 

Dv = s + n, 

with n denoting the measurement noise usually assumed to be Gaussian and uncorrelated, 
and D denoting the interaction matrix between DM and WFS. 

In order to solve for v, current AO systems derive a reconstruction matrix R and multiply 
it with s. In the most simple case, R is the inverse of D - or strictly, the Moore-Penrose 
pseudoinverse, since D is usually degenerate and not square and so not directly invertible. 
However, this simple approach normally leads to large noise amplification, so some kind 
of modal decomposition, filtering and weighting is involved in the calculation of R (e.g. 
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Gendron fc Lena 1994L IVerinaud fc Cassaing 200ll . IWallner 1983h 



Unfortunately such vector-matrix- multiply reconstructors scale in complexity with 0(n ), 
where n is the number of degrees of freedom of the system. Since a reconstruction must 
be carried out at each time step (~ 1 ms) of the control loop, and the delay between 
measurement and corrective action must be very small (~ one time step), the computa- 
tional load quickly becomes very demanding for the next generation of very high-order 
AO systems. Several approaches exist in order to reduce this complexity. For example, 



the FFT-based reconstructor (|Povneer. Gavel fc Br asc 2002) scales with 0(n log n) and 



has successfully been tested in the labora tory ( Povneer et al. 20081 ); the fractal iterative 



method (FRIM) jThiebaut fc Tallon 201oh scales with 0(n), but needs a few iterations of 



a conjugate gradien ts search algorithm to converge; the cumulative reconstructor (CURE) 



( Rosensteiner 20111 ) scales with 0{n) in a single step. 

Wavefront reconstruction and control can further be improved by predicting the system's 
state including the residual wa vefront error in a linear-quadratic-Gaussian (LQG) or Kal man 



filter based control approach (|Le Roux et al. 2004 . iPovneer. Macintosh fc Veran 20071 ) . In 



such a scheme, te lescope vibration s can also be incorporated in the state vector and effi- 



ciently corrected (jPetit et al. 20081 ). The drawback of such advanced control methods is 
again the added computational complexity, which can be mitigated by applying it to the 
most critical modes only, i.e. just to tip & tilt in the case of vibration filtering. 

2.2.3 Deformable Mirrors The objective of the DM is to correct for the optical 
path differences introduced by the turbulent atmosphere. It usually consists of an array of 
actuators which are connected to a thin optical surface that deforms under the expansion of 
the actuators. The most important parameters for a DM are stroke, response time, spacing 
and number of actuators. Spacing (projected onto the telescope's aperture) and response 
time should agree with the requirements set by ro and To, while stroke and number of 
actuators scale with aperture diameter. Assuming an infinite outer scale of turbulence Lo, 
the stroke increases with aperture diameter as d 5 ^ 6 according to the structure function of the 
Kolmogorov model. In reality, the structure function flattens out at an outer scale of order 
20-50 m, and stroke requirements are of the order of tens of microns optical path difference. 
While the largest DMs nowadays have some thousand actuators, AO at visible wavelengths 
at a 40-m telescope would require a DM with several ten thousands of actuators. 

Three main technologies are used to produce AO DMs (see Fig. [5]l . The largest are the 
adaptive secondary or deformable secondary mirrors (DSM). Replacing the static secondary 
mirror of the telescope, they provide AO correction while maintaining high transmission 
and low thermal emissivity by avoiding extra relay optics. The actuators can be voice coils, 
which are locally positioned by an internal control loop. They are typically separated by 
a few cm and attached to an optical shell, about 1 m in diameter but only 1-2 mm thick. 
Especially the polishing and handling of this shell is one of the great challenges in producing 
DSMs. The first such mirr or was installed in 2003 at the 6.5-m Mul tiple Mirror Telescop e 
(MMT) (jBrusa et al. 20031 ). Another one is in operation at the LBT fesposito et al. 20ld ): 
one is undergoing laboratory tests for the Magellan Telesco pe Jciose et al. 201p|) : and one 
more is in construction for the Very Large Telescope (VLT; lArsenault et al. 201oh . 

More common are medium size piezo DMs with an actuator spacing of several millimeters. 
They have less stroke than DSMs, but at about 10 /im peak-to- valley it is still sufficient for 
8-10-m class telescopes. In addition, piezo DMs have a significantly reduced response time 
of order a hundred microseconds. Since piezo DMs usually do not incorporate local position 
control and are affected by hysteresis and thermal expansion, they need to be controlled by 
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an AO system to provide a precise, stable wavefront quality. 

Quite recently, micro-optical-electrical-mechanical systems (MOEMS) have emerged as 
an alternative. They use electro-static or voice-coil actuation mechanisms and are produced 
using standard semiconductor fabrication technologies. With typical interactuator spacings 
of a few hundred microns, MOEMS are significantly smaller than other DMs. They have 
almost instantaneous response times, do not suffer from hysteresis, and are relatively inex- 
pensive to produce with many actuators. However, with a very large number of actuators 
as required by the next generation ELTs, channel routing and high actuator yield become a 
technological challenge. Some MOEMS as well as small spaced piezo DMs only have a small 
actuator stroke of the order ~ 2 /im. AO systems using such mirrors need to be operated 
with a second large stroke DM with fewer actuators a cting as a woofer that pre-flattens the 
wavefront to the required level I Hampton et al. 20061 ). 



2.3 Estimating Performance 

The performance of an AO system can be evaluated using a number of different criteria 
or metrics. These metrics have been chosen to be relevant for particular science cases. A 
"good" AO system for hunting exoplanets may not be useful for extragalactic science, and 
trade-off studies during a conceptual design must take into account the desired performance 
for the intended applications. Probably the most common performance metric for AO is the 
residual wavefront error variance o^fe over the telescope aperture. Another very com- 
mon performance metric is the Strehl ratio S. These two metrics are, however, somewhat 
redundant since they are linked through the Marechal approximation: 

5* ~ e ~ a WFE_ 

Ross (2009) showed that this approximation holds exactly in the common AO case of Gaus- 
sian phase residuals, and it is still a very good approximation for many other distribution 
laws. The residual wavefront error variance can be split up into individual contributors 
assuming that those are uncorrelated: 

o~wfe = "fit + &lec + &bw + other. 

The main error contributors are i) the fitting error a 2 f it which gives the residual high spatial 
frequency wavefront error which the DM, with its finite number of actuators, cannot fit 
anymore; ii) the reconstruction error a^ ec which combines all effects that reduce wavefront 
reconstruction accuracy (measurement noise, calibration noise, sampling errors, aliasing, 
chromaticity, etc.); and iii) the temporal bandwidth error of w which accounts for the finite 
response time of the AO system to the dynamic turbulence. The latter two terms are 
within the control space of the AO system. A good AO concept properly balances the error 
terms in order to avoid excessive specifications of some components that do not present a 
benefit to the overall performance. Depending on the application, there may be other error 
sources in addition to those given above. For example, observation of an object that is not 
coincident with the AO guide star will introduce angular anisoplanatic error, and using a 
single laser guide star (see Sec. 12. 4|) will lead to the cone-effect or focal anisoplanatic error. 

High spatial frequency wavefront errors degrade many types of science more than lower 
frequency ones, because they scatter the light far away from the image center. Low spatial 
frequency errors instead leave most light concentrated in the vicinity of the center, and 
the corresponding loss could be compensated with slightly bigger pixels without a large 
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Figure 5: 

Main technologies used to produce AO DMs. Upper left: Lab testing of the LBT adaptive 
secondary mirror system with 672 actuators (Photograph by R. Cerisola). Upper right: A 
4096-actuator micro-optical-electrical-mechanical-system (MOEMS) deformable mirror 
(Photograph courtesy of S. Corn elissen, Boston Micr omachincs Corp.). Bottom: Piezo high-order 
1377 actuator DM for SPHERE l|Sinquin et al. 200gf ). 



penalty on sensitivity and resolution. This reasoning led to the encircled energy metric 
which defines the radius in which a certain fraction (typically 50% or 80%) of the total light 
is concentrated. The overall budgeting of multiple error sources is however difficult since 
the encircled energy cannot easily be written as the sum or product of individual terms like 

the residual wavefront variance above. 

The missing budgeting capability of the encircled energy led lSeo et al. (20091 ) to propose 
the normalized point source sensitivity (PSSN) metric. The PSSN for AO characterization 
is defined as the ratio of the sum of an observed point source image intensity ipgp squared 
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over that of a perfect imaging system working at the diffraction limit: 



PSSN : ^ Plx ^ SF ' ° bs 



V I 2 
L-, V ix PSF, perf 

Similar to the Strehl ratio, the PSSN is equal to one for perfect AO performance and 
approaches zero when residual errors are large. It is also approximately multiplicative for 
multiple error terms as long as those are low spatial frequency and weak, i.e., the total 
PSSN is the product of the individual PSSNs of each error term. 

A somewhat special case is presented by the high-contrast imaging of faint companions 
and exoplanets around bright stars. Here, the objective of the AO and wavefront control 
system is to produce a very good correction performance or residual wavefront variance as a 
function of spatial frequency. This is because the residual i mage intensity is proportional to 



the residual spatial power spectrum of the wavefront (e.g. Guvon 2005 . Perrin et al. 2003 ) 
when aberrations are small. A DM can suppress aberrations up to its correction radius 
ao = A/ (2d), where A is the wavelength of the observation and d is the interactuator 
spacing projected back to the telescope aperture. As such, it must create a "dark hole" in 
this region (for example, as seen in the right panel of Fig. [T]) that is void of scattered stellar 
light, and in which one could search for faint companions. 



2.4 Laser Guide Stars 

For a reasonable correction performance in the near-IR, AO systems need sufficiently bright 
(~ 15 mag) guide stars within 0q of the astronomical target. This requirement strongly 
limits the sky coverage that can be attained using natural guide stars (NGS); only about 
10% of the sky can be observed with AO on average while there is a large difference between 
the galactic plane (severa l tens of percent) and the galactic pole (a few tenths of a percent) 
( Ellerbroek fc Tyler 199sl ). Especially the very low sky coverage at high galactic latitudes 



hinders extragalactic astronomy from using AO efficiently. 

It was the US military, in a classified programme, that first proposed to overcome this 
problem by u sing laser beacons to create artificial sources (laser guide star, LGS) (see 
Beckers 19931 ). A f ew years later the con cept was independently proposed in an astro- 



nomical context by iFov fc Labevrie (1985). The two mechanisms considered so far are 



i) Rayleigh scattering in the dense regions of atmosphere up to altitudes of about 30 km 
above the ground, and ii) resonance fluorescence of sodium atoms which are concentrated in 
a layer at about 90 km height. The first succe ssful tests of a Rayleigh LGS were performed 
at the Starfire Optical Range 1.5- m telescope (Fugate 19921 ): the first astronomical LGS AO 



systems were installed at the Lick (|Max et al. 19971 ) and Calar Alto l|Eckart et al. 200oh ob 



servatories in the mid 1990s. Still it took another ten years until th e technology had mature d 



enou gh to be installed at 8-10-m class telescopes such as Keck II ( Wizinowich et al. 2006 \ 



VLT ( Bonaccini Calia et al. 20061 ). Gemini North (jBoccas et al. 20061 ). and Subaru ( Havano et al. 2010l ) 



Most of the LGS AO systems mentioned above use sodium laser guide stars generated by 
lasers emitting light of 589 nm wavelength to excite sodium D2 lines. One of the greatest 
technological challenges has been the generation of the laser light itself. Dye lasers were a 
common choice for early systems, but these are bulky (see top left of Fig. [6}, inefficient and 
sensitive to environmental conditions. Dye LGS therefore require high maintenance and 
preparation time before an observing night and produce significant operation overheads. 
Solid state sum-frequency lasers, which may be the best option for a pulsed laser beam, are 
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Figure 6: 

Left: VLT LGSF dye laser PARSEC (Ratten c t al. 20021 , picture credit ESO/H. Zodct) with 
> 10W output power. Right: Solid state sum-frequency laser for the Gemini South MCAO 
system (courtesy of C. D'Orgeville). Bottom: Raman fiber laser prototype (picture credit ESO) 
with 20 W output power. 



used at some observatories and offer a significant improvement to this situation. They are 
also currently the most powerful sodium line lasers available. T he Gemini South MCAO 



system, for example, uses a single 50 W laser to produce its 5 LGS (|D'Orgeville et al. 2011 



The maintenance issues of both dye and sum-frequency l asers now appear to be solved by the 
ruggedized and compact Raman fiber laser technology ( Bonaccini Calia et al. 20101 ) which 
has dem onstrated a 20 W outp ut power. It will generate the LGSs for ESO's adaptive optics 
facility ( Arsenault et al. 20101 ) . and is planned to be used for the European ELT. 

LGS also possess considerable drawbacks with respect to NGS. First, due to the finite 
distance between telescope and LGS, the backscattered beam does not sample the full 
aperture at the height of the turbulent layers. This cone effect or focal anisoplanatism is 
more severe for larger apertures and higher turbulent layers. Second, an LGS cannot be 
used to measure tip and tilt of the wavefront since the contribution of the upward projection 
jitter cannot be disentangled from the measurement. Hence, an NGS is still required, but 
the reduced dem ands on its brightness and distance to the target lead to a ten-fold increase 
in sky coverage ( Ellerbroek fc Tyler 199sh . 
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ELTs offer an interesting alternative to using LGSs, if the AO system operates in open 
loop. This is a very different mode of operation from typical closed loop AO systems, where 
the deformable mirror is in the optical path between the incoming light and the wavefront 
sensor so that one is only measuring the residual wavefront error. In these systems, if the 
AO is functioning well, the measured residual wavefront will be close to zero. In contrast, 
with open loop control, the full wavefront error is measured in each cycle and applied to a 
DM that corrects the path over a restricted field in one or more specific directions. There is 
thus no direct feedback about how well the wavefront in the science field is being corrected. 
With increasing telescope diameter, the beam overlap in the highest turbulent layers also 
increases and the angular separation of stars to sample the at mosphere can be larger. In an 
application of the multi-object AO technique (see Sec. l5.3.2jl . lRagazzoni (201ll ) argues that 
for 40-m aperture and 10' search area, the probability to find at least three stars which are 
bright enough for wavefront sensing (V~15mag) and reasonably well sample the highest 
layers over a 2' scientific field approaches 100%. Tomographic reconstruction and open loop 
control would then allow for nearly all-sky coverage using only NGS. 



3 Astrophysical Applications of Adaptive Optics 

During the last decade, simple SCAO using natural and laser guide stars, has emerged from 
being a niche technology and is beginning to have an impact in mainstream astrophysics. 
In this section we focus on a few examples drawn from what is now a vast AO literature, 
to illustrate how it has enabled a deeper understanding of the physical processes at work 
across a vast range of scales and epochs throughout the universe. 



3.1 Sun and Solar System 

3.1.1 The Sun One of the biggest successes of adaptive optics has been in driving 
forward our understanding of the sun's photosphere. This complex layer is the interface 
between deep regions where turbulent convection dominates the dynamics, and the mag- 
netically controlled chromosphere and corona above. It is characterised by granulation 
from hot upflowing convective cells which sweep the uplifte d magnetic field into the d ark 
cooler downflowing lanes between them. As pointed out bv lRimmele fc Marino (20111 ) in 
their review, AO systems, where they have been installed on the current generation of 1-m 
class solar telescopes, are used for the vast majority of observations. The scientific impe- 
tus for developing the next generation of 4-m class solar telescopes is entirely due to the 
enhanced performance afforded by AO. Solar adaptive optics faces different challenges to 
those of night-time astronomy. Observations are, by definition, performed during the day, 
and often at high airmass. In addition, the wavelengths of scientific interest are typically 
in the optical regime (e.g. the G-band at 430 nm). All these make good AO performance 
hard to achieve. Yet the wavefront sensing is accomplished using the low contrast, time 
varying, granulation of the sun's photosphere. As such, the spots neede d for a standar d 
Shack-Hartmann sensor are created using a cross-correlation technique i Rimmele 2000h . 
The reason that adaptive optics is able to have such an impact on solar physics is be- 
cause the large scale structures are intimately linked to the small scale dynamics. The 
two scale lengths that determine the structure of the solar atmosphere, the pressure scale 
heigh t and the photon mean free path, are both of order 70 km, corresponding to about 
0.1" ( Sankarasubramanian fc Rimmele 20081 ). the optical diffraction limit of 1-m class tele- 
scopes. Amongst the largest solar telescopes currently operational is the 1.6-m clear aper- 
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ture telescope at Big Bear Solar Observatory. The first results with its 76-aperture AO 
system reached a resolution of 0.12 at 706 nm (|Goode et al. 20icl ). The data show that, in 
contrast to expectations and previous lower resolution observations, the smallest scale mag- 
netic fields along the inter-granular lanes appear to be isolated points. Their long lifetimes 
of several minutes suggest they are anchored deep beneath the photosphere. Thus, as the 
granules on either side collide, they are forced into cyclonic flows. The resulting twisting of 
the magnetic field may be an important way to transfer energy upwards from a significant 
depth. 



3.1.2 Asteroids At the opposite end of the solar system's mass range, AO is showing 
that massive asteroids are surprisingly porous, with implications on their formation mech- 
anism. The best method to measure the mass, and hence bulk density, of an asteroid is 
to observe the orbit of a 'moon' around it. Today, about 150 main belt binary asteroids 
are known, and AO - using the asteroids themselves as the wavefront references - is a 
key diagnostic for studying these systems. The first was discovered using the PUEO AO 
system on the Canada- France-Hawai'i Telescope (CFHT): the 4.7 day orbit of Petit Prince 
around 45 Eugenia reveale d that the density of Eugenia is only ~ 20% greater than that of 
water ( Merline et al. 19991 ). AO on the VLT led to the confirmati on that some asteroi ds are 
multiple systems, with the discove ry that 87 Silvia has 2 moons (|Marchis et al. 20051 ). In a 
larger survey using AO on Keck II. lMarchis et al. (2006bT ) imaged 33 V < 13 mag main belt 
asteroids, achieving 2 /im resolutions of 50-60 mas, corresponding to < 100 km at a typical 
distance of 2 AU. One remarkable system, 216 Kleo patra, is highly elongat ed in a 'dog-bone' 
shape, and also has 2 moonlets about 6 mag fainter (|Descamps et al. 20111 ). Its bulk density 
of 3.6gcm~ 3 is a factor of two lower than that of pure iron, and of the spectroscopically 
estimated surface grain density of its meteoritic analogue, implying that the asteroid is 
rather porous. 216 Kleopatra may therefore have formed as matter re-accumulated after 
a catastrophic impact disrupted a parent asteroid; its loosely packed material means that 
as it spun up, it became elongated, analogously to a spinning liquid mass; and that the 
satellites result from shedding of some of this material. These authors have analysed a 
number of other weakly bound aggregates in similar detail using adaptive optics also on the 
VLT and Gemini North. Among these is 617 Patroclus, which comprises 2 similar compo- 
nents separated by 680km with a bulk density of only 0.8 gem -3 . Its angular momentum 
i s high enough to not only have elongated the body but to split it into the observed binary 



(jMarchis et al. 2006al . iMerline et al. 2001 ). These results, de rived from applying AO imag 



ing at multiple epochs, have led iDescamps fc Marchis (20081 ) to propose such 'rubble-pile' 
models as a general evolutionary scenario for these asteroids. 



3.1.3 Planets and their Satellites The planets, with angular diameters of a few 
arcsec to an arcmin, are obvious targets for adaptive optics. Ground-based observations 
of Jupiter and Saturn (jGlenar et al. 19971 ) using LGS at the U SAF Starfire Optical Range, 
and of the three ring arcs around Neptune ( Sicardv et al. 19991 ) using PUEO at the CFHT, 
were among early successful applications to planetary science. AO is particularly suited to 
studying the atmospheres of planets and their satellites, since different spectral bands are 
able to probe changes in the distribution and abundance of a variety of molecules and ices to 
different depths. One frequent target is Titan, the only satellite to have a dense atmosphere. 
It has now been observed for more than a decade with a variety of AO systems, spatially 
resolving its 0.8" diamet e r face and tracking seasonal and diurnal changes in its atmosphere 
( Adamkovics et al. 20071 . Illartung et al. 20041 . Ikirtzig et al. 2006t ). 
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Figure 7: 

Multi-wavelength imaging using AO on 8— 10-m class telescopes (often together with HST optical 
imaging) is central to many science analyses. Here this complementarity is applied to Jupiter, 
using AO observations from 1 )im to 5 /im. The left panel shows a number of southern ovals at 
1.3—1.6 mum wavele ngths. The right pane l shows that at 5 fim the smaller ovals are surrounded by 
rings. Adapted from lde Pater et al. (201(ih (courtesy of I. de Pater). 



Affording a special technical challenge to AO are observations of a planet or satellite while 
using another as the guiding reference, because of their different non-sidereal motions. One 
example is fo, which has been observe d in Jupiter's shadow to highlight its volcanic activity, 
with Ganymede as the AO reference (|de Pater et al. 2004 ). Jupiter itself is another exam- 
ple. Although such observations are complex, they enable one to obtain data across a vast 
wavelength range at similar spatial resolution, from optical space-based imaging with the 
Hubble Space Telescope (HST) to near and mid-infrared ground-based AO on 8-10-m class 
telescopes. This provides a view of the reflected sunlight as well as Jupiter's own thermal 
emission, enabling one to probe to greater depths through the atmosphere. One remark- 
able result, illustrated in Fig. [7J is that the oval storm systems are typically surrounded 
by bright rings at 5 ^tm I de Pater et al. 2O10l ). The 225-250 K brightness temperature of 
these rings suggests they are cloud-free down to a depth at which the pressure reaches 
at least 4 bar. Radiative transfer modelling of the 1-2/im data shows that the ovals have 
similar vertical structure, the main differences between them being the particle densities 
in the tropo- to stratospheric hazes. These observations have led to a model in which the 
ovals are anticyclones where air is rising at the ce ntre up to the tropop ause at a pressure 
of ~ 0.1 bar, and descending around the periphery ( de Pater et al. 2010l ). Importantly, the 
downflows cannot exist at radii greater than 1-2 times the Rossby radius, where rotational 
effects become as important as buoyancy effects (i.e. where the coriolis force has turned the 
velocity vector by 90°), about 2300 km at the latitude of the observations. This is why the 
12000 km diameter Oval BA does not exhibit a 5 /im ring: instead the downflow occurs in 
the outer red part of the oval. For the much larger Great Red Spot, the size of the system 
means that the Rossby radius, and hence the downflow, is located more towards the centre. 



3.2 Star Formation 
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3.2.1 Stellar Multiplicity Using adaptive optics to assess the multiplicity of stars 
is an obvious science goal, but one that is difficult to achieve: it requires a large well 
selected target sample, and very careful bias and sensitivity corrections that depend on 
the performance of the AO system as well as the magnitude and separ ation of potentia l 
companions. AO offers clear advantages over the earlier speckle imaging (|Ghez et al. 19931 ) 
by enabling one to study higher contrast systems as well as fainter objects. In the former 
case, AO has been cardinal in searches for low mass compani ons in the vicinity of higher 
mass stars, from an early survey of O B stars using ADONIS JShatskv fc Tokovinin 2 002) 
to observations around solar analogues (jMetchev fc Hillenbrand 20091 ). In the latter case, a 
key role for AO has been to resolve very low mass binaries (Mtot < O.I85M0) in the near-IR 
where these cool objects emit most of their luminosity, and where AO works well. Multi-year 
studies of their orbits have allowed accurate masses and luminosities to be measured; which 
in turn has led to calibration and verification of their ev olutionary mass, luminosity, and 
age tr acks ( Close et al. 20071 . IZapatero Osorio et al. 20041 ) . Intriguingly, while theoretical 
tracks ( Chabrier et al. 2000l ) are a reasonable match to the data for low mass stars, applying 
the same atmospheric models to even lower masses (< 13 Mj) fails to predict the very red 
low luminosity objects now known as giant exoplanets (see Sec. 13. 2. 311 



O ne of the largest AO surveys of very low mass stars was carried out bv lClose et al. (20031 ) 
and ISiegler et al. (2003h . who used the Hokupa'a AO system on Gemini North to survey 
69 stars of spectral type M6.0 to L0.5. They found 12 systems with very low mass or 
brown dwarf companions, yielding a sensitivity corrected binary fraction of order 10%. The 
pairs in each binary have similar masses, and their separations are only a few AU (with 
none beyond 15 AU). Remarkably, these characteristics differ significantly from the slightly 
more massive G dwarfs for which the binary fraction is around 50%, and that exhibit a 
wide distribution of separations cen tered around 30 AU. An a dditional discrepancy be tween 
the populations was pointed out by Dupuv fc Liu (2011 ) and Konopackv et al. (20ld ) who, 
both using AO on Keck II, compiled orbital data for 16 and 15 very low mass and brown 
dwarf binaries respectively. Both studies highlighted a preponderance of almost circular 
orbits, and find at best only a marginal correlation between eccentricity and period. The 
favoured model to explain t hese popula tion differences, for which recent hydrodynamical 
simulations are reported by Bate (20091 ). suggests that older (~Gyr) field brown dwarfs 
are systems that have been ejected at speeds of a few kms -1 from the cluster in which 
they formed. For very low mass stars, only the most tightly bound systems survive such 
a velocity kick, resulting in both low multiplicity and small separations, and favouring 
low eccentricities. While some discrepancies between observations and simulations re main, 
the global view i s compelling. In a new twist to this picture, IClose et al. (2007al ) and 
Biller et al. f 20111 ) find that young (< 10 Myr) very low mass and brown dwarf binaries 
can have much wider separations, beyond even 100 AU. The proposed explanation is that 
the survival time of a binary system depends on how tightly bound it is, as well as the 
stellar density in its local environment. Wide binaries seen at young ages are likely to 
be disrupted by stellar interactions, and so are absent from the population of older field 
binaries. Confirmation that a high multiplicity is established early on comes from AO 
observations s howing that 30-50% o f embedded protostars are binaries with separations up 
to ~ 1000 AU (jDuchene et al. 2007) and that many close protostar pairs may have formed 
via ejection of a third star ( Connellev. Reipurth fc Tokunaga 20091 ). 



3.2.2 Circumstellar Disks It is the disks around young stars that can shed light 
on how binary stars and planetary systems form. The interest and importance of this 
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topic is reflected in the huge activity over the last decade in studying disks around stars 
of all mass ranges, stimulated by the advent of high resolution imaging and interferometry. 
While AO has not been the key driver in this field, it has added a new aspect by providing 
near- and mid-infrared data at the same resolution as optical HST images. The resulting 
multi-colour data make it possible to probe the size and structure of the dust grains and 
hence infer how the disk evolves. The first circumstellar disk observed with AO was /3 Pic 
( Golimowski et al. 199^ ). and numerous subsequent AO observations have revealed warps 
and sub-structure in this disk on 1 AU scales (Fig. [8] left). T he binary T Tauri s ystem 
GG Tau A-B (Fig. [8] right) was also the focus of early studies jRoddier et al. 19961 ). AO 
imaging with Keck II of its 3.8 pm scattered light, combined with optical HST data, has 
indicated that the dust in the ring may be stratified, with th e larger grains either settling 
towards the midplane or preferentially growing there faster ijDuchene et al. 2004 ). How- 
ever, it is not clear h ow universal the se processes are sinc e AO observations of HV Tau C 
( Duchene et al. 2010l ) and HK Tau B (jMcCabe et al. 2011T ) show that, despite their similar 
ages, these 3 disks exhibit differing dust properties, which implies differing growth and/or 
settling times for their constituent grains. Complementary to work based on the scattering 
phase function of grains are direct measurements of absorption from specific molecules. Us- 
ing the MMT adaptive secondary AO system to obtain spatially re solved spectroscopy o f 
the 10 Lira silicate feature in each component of nine T Tauri systems, ISkemer et al. (2011bl ) 
pointed out that, despite the large variation among different systems, shared properties 
within a system likely play an important role in dust grain evolution. In a more massive 
system, detection of water ice via its 3.08 /im absorption in the disk around the Herbi g 
Ae star HD 142527, was achieved using AO coronagraphy on Subaru ijHonda et al. 20091 ). 
While water ice is believed to promote the formation of cores in protoplanetary disks, this 
observation is perhaps the most direct evidence confirming its existence. 

Returning to GG Tau, there are intriguing hints that its disk may be tilted with respect 
to the orbit of its system. Astrometric data obtained with AO on the VLT shows that 
the abrupt inner edge of the di sk cannot be due to the stars unless their orbit is tilted by 
~ 25° with respect to the disk (jKohler 20111 ). The only way to avoid this conclusion is to 
hypothesize the existence of a companion. Many circumstellar debris disks do show signifi- 
cant sub-structure, which is believed to be driven by orbiting c ompanions, and can provide 
insights into the mechanisms of planet formation ( Wvatt 20081 ). Indeed AO, together with 
non-redundant aperture masking for which the interferometric analysis leads to superior 
PSF calibration and speckle suppression, is now provi ding evidence tha t the central holes 
in the disks a re due to the presence of a binary star (Kraus et al. 20*121 ) or are carved by 
giant planets ( Huelamo et al. 2011 . Kraus fc Ireland 20121 ). 



3.2.3 Extrasolar Planets Direct imaging of exoplanets is a topical high prior- 
ity science goal and one of the drivers of future ELTs. It requires extremely high con- 
trast (> 10" 9 ), long exposure, coronagraphic imagi ng, combined with very careful control 
and characterisation of the residual speckle pattern 1 Qppenheimer fc Hinklev 20091 ). These 
lead to demanding requirements for the AO, as well as for the associated instrumenta- 
tion and post-processing techniques (see Sec. 14.1.41 and I5.4|). Although a fe w exoplanets 



have now been imaged d irectly, a number of surveys - e.g. Biller et al. (20071 ) (45 targets) 



Lafreniere et al. f 2007a ) (85 targets), and Kasper et al. (2007t ) (22 targets) - have failed 



to 



detect any planets down to limits of a few Mj. 

The first una mbiguous confirmat ion by proper motion analysis of a multi-planet system 
was reported bv lMarois et al. (20081 ). HR 8799 is a 1.5 Mq A5V star at a distance of 39.4 pc 
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Figure 8: 

AO images of circumstellar disks. Left: composite image of /3 Pic. The outer part shows the dust 
disk observed in the K-band with ADONIS; the central part is an L-band image from NaCO on 
the VLT which shows a companion 0.4" (8AU) to the north-west of the star (image credit: 
ESO/A.-M. Lagrange et al). Right: H-band image of light reflected by the disk around GG Tau, 
obtained with Hokupa'a on Gemini North, confirming the presence of a gap in the disk at a 
position angle of 270°. The locations of the binary stars are indicated, (image credit: Daniel 
Potter/University of Hawaii Adaptive Optics Group/Gemini Obscrvatory/AURA/NSF). 



with an age of 30-160 Myr. It is now known to have 4 planets at di stances of 14-68 AU 
and masses in the range 7-10 Mj, which all orbit in the same direction ( Marois et al. 201ol ). 
But the existence of gian t planets at such a wide range of distances, as well as their low 
luminosities, is a puzzle ( Close 20101 ). While the outer 3 could have been formed via the 
fragmentation of a disk of gas and dust, at the distance of the innermost planet the disk 
would have been neither cold enough nor rotating slowly enough for this mechanism to 
work. The alternative process is via the agglomeration of grains. This has been suggested 
for the ~ 9Mj planet /3 Pic b, also discovered by AO imaging, which orbits a 1.8 M© 
star at a distance of 8-15 AU jLagrange et al. 2010T ) (Fig. [8)). However, for HR 8799 d, 
it is barely fast enough to form the planet before the matter is accreted onto the star 
itself, and certainly could not have formed the outer planets. A more rapid growth rate 
could be achieved if the surrounding disk were dynamic ally cool, which would require it 
to consist of Pluto-mass planetesimals ( Currie et al. 201ll ). While this is not impossible, it 
is certainly considered an infrequent phenomenon. In terms of their low luminosities, the 
favoured explan ation - which was suc cessfully applied to the first exoplanet directly imaged, 
2MASS 1207 b jchauvin et al. 2005! ) - is that very t hick high latitude clo ud bands absorb 
and scatter light when the planet is viewed pole-on (jSkemer et al. 2011a! ) . Observational 
confirmation of this conjecture for HR 8799 is still at an early sta ge but it is a likely scenario 
if the clouds are thicker than found in brown dwarf atmospheres (|Currie et al. 20111 ). Given 
such a status quo, the next generation of planet imagers is eagerly awaited. 
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3.3 Resolved Stellar Populations 

Another major goal of ELTs is to spatially resolve stellar populations in nearby galaxies, in 
order to trace their star formation histories (ages, metallicities) by mapping the loci of indi- 
vidual stars on colour magnitude diagrams (CMD) l|Deep et al. 201ll : loisen. Blum fe Rigaut 20031 ). 
Adaptive optics is a natural technical solution to the problem of crowding in these excep- 
tionally dense stellar fields, the limiting factor in such work. However, the better diagnostic 
power of CMDs at shorter (optical) wavelengths conflicts with the increasing performance 
of AO systems at longer (near-infrared) wavelengths. This has led to the requirement 
to enhance AO at shorter (0.7-0.8 /xm) wavele ngths as well as t o develop new te chniques 
for estimating the ages of stellar populations I Bono et al. 201ol Deep et al. 201ll ). These 
obstacles mean that AO on 8-10-m class telescopes currently struggles to be competitive 
with optical space-based work. Despite this, some progress has bee n made probing the 
stellar ages and star formation rates in the bulg e and disk of M31 (Davidge et al. 2 005. 



Olsen et al. 20061 ) and the nearest dwarf galaxies ( Melbourne et al. 201C ) . 



The technique is highly effective when applied to galactic star clusters, notably yield- 
ing insights on the hotly debated topic of whether or not there is a universal initial mass 
function (IMF, defined as a power-law with slope V: dN / dl oqM oc M~ T , with a standard 
Salpeter value of T — 1.35) ( Bastian. Covey fc Meyer 2010h . NGC3603 plays a role here 
because it is one of the most massive and densest star forming clusters in the Galaxy, and is 
often considered to be a local template for the massive star cluster s found in starbursts and 
exte rnal galaxies. Using AO to mitigate the crowding problems, lEisenhauer et al. (1 998) 
and lHaravama. Eisenhauer fc Martins (20081 ) probed the stellar population to sub-solar 
masses, far lower than achieved previously. Their intriguing result is that, although at 
higher masses the IMF slope is consistent with the standard value, at M < 15 Mq the slope 
has F = 0.74 indicative of a top-heavy IMF. Although the mass segregation in NGC3603 
- evide nt through a flattening of the IMF at ra dii < 5" (0.15pc) - may complicate the 
picture, lHaravama. Eisenhauer fc Martins (20081 ) found no evidence for a steepening of the 



IMF slope above F ~ 0.9 at any radius out to 110" (3pc). However, while the IMF is 
distinguishably flatter than Salpeter in NGC 3603 (and also in the Galactic Centre clus- 
ters), there are other massive star forming clusters which do not exhibit such an effect 
( Bastian. Covey fc Meyer 2O10l ). Thus, the cause of such variations, and the fundamental 
question of whether the IMF does systematically vary with environment, is still open. 

The ability to image stellar fields at high angular resolution leads directly to a second 
scientific application: measuring the proper motions of stars, and hence deriving the internal 
kinematics of clusters or galaxies, as well as their global motion. If the sources of error can 
be controlled to a sufficient level, then AO astrometry will become a major capability of 
ELTs, yielding a pr oper motion precis ion of lO^iasyr - after only 3-4 yrs, equivalent to 
5 kms -1 at 100 kpc (jTrippe et al. 20ld ). Such techniques have already been applied to the 
Arche s Cluster, which is only 30 pc from the Galactic Center, using data from NaCo on the 
VLT ( Stolte et al. 20o"8i ). Observations over a 4.3 yr baseline show that this young cluster 
has moved by 24.0 ±2.2 mas with respect to the field. Including its line-of-sight motion, the 
3D velocity is 232 ± 30 kms" 1 with a trajectory that rules out suggestions that it could be 
a rejuvenated globular cluster. Instead, the Arches Cluster may be transitioning between 
the xl and x2 orbit families associated with the Galaxy's barred potential. 



Adaptive Optics for Astronomy 19 



Figure 9: 



Left: adaptive optics imaging of the Galactic Center in H, K 3 and L bands (blue, green, and red 
respectively). The location of Sgr A* is marked by the 2 yellow arrows. Right: astrometric 
measurements of the star S2 (or S02) over a period of nearly 20 years has tracked more than one 
orbit around Sgr A* (the grey crosses mark the location of the near-infrared flares, very likely 
coming from within ten times the Schwarzschild radius of the massive black h ole ). In the same 
coordinate system the Keck and VLT astrometry is consistent. Adapted from lGenzel et al. (201oT ) 
(courtesy of S. Gillcsscn). 



3.4 The Galactic Center 

The Galactic Center (GC) provides an exquisitely detailed view of the physical processes 
occuring in the nucleus of our Galaxy and around its central massive black hole (BH), which 
can be directly applied to the nuclei of other galaxies. It is a rapidly advancing field in which 
spatially resolving the stellar populations is of particular interest. The severe crowding 
means that AO plays a central role despite the fact that the GC is highly obscured and has 
no optically bright guide stars. Instead, because a K ~ 6 mag star lies just 5.5" away, it is 
a primary science driver for infrared wavefront sensing. C urrently, NaCo on the VLT is the 
only camera with an infrared WFS (jltousset et al. 20031 '). In contrast, when observing the 



GC from Keck II, a laser guide star must be used; that the GC never rises above 41° from 
that latitude adds to the difficulty of obtaining high resolution data. But observations from 
both observatories have been highly successful. A complete rev iew of recent observ ational 
and theoretical progress on the central parsec is presented by iGenzel et al. (2010l ). The 



main advances that have been made using AO are summarised below. 

The ~ 40 mas resolution that can be achieved in the H-band on 8-10-m class telescopes 
(Fig. [9] left) has enabled the stellar distribution to be traced to scales well below 0.04 pc 
(1" at the distance of 8kpc) from Sgr A*. Although the cusp maximum is centered on 
Sgr A*, the radial distribution of different stellar types varies consid erably; and much of the 



increase in the central l" is due to a high concentration of B stars ([Bartko et al. 2 010) 



Analysis of the stellar motions (Fig. |9j right) , afforded by 150-300 /xas astrometry ( Fritz et al. 20ld . 
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Ghez et al. 20081 ). shows that while the vast majority of stars in the central parsec are old 
and have randomly oriented orbits ( Yelda et al. 20101 ). about half of the young stars in the 
central 10-15" are confined to a warpe d clockwise disk, while many of the remainder may 
be in a second counter-clockwise disk ( Bartko et al. 20l"ol . Lu et al. 20091 ). This has put 
strong constraints on the mechanism of the star formation event(s) 6Myr ago that gave rise 
to these stars. 

A combination of astrometry and line-of-sight velocities has enabled the full 3D orbits of 
abo ut 30 stars to be determined, showing b eyond reasonable doubt that Sgr A* is a massive 
BH |Ghez et al. 20081 . iGillessen et al. 20091 ). The combined Keck II and VLT data sets are 
yielding ever more precise measurements of the distance to the GC and mass of the BH, 
for both of which the systematics associated with the dista nce is now the dom inant error 
term. The current best values are 8.3 kpc and 4.3 x 1O 6 M (|Genzel et al. 2010l ). 

Light curves of near-IR flares from the accretion flow towards Sgr A* are now regu- 
larly measured. Statistical analysis of their frequency, brightness, and variations indicate 
that there may be two types: a power-law distribution of occasional bright flares showing 
substructure, superim posed on a lognormal distribution of faint continuous variability char- 
acterised by red noise l|Po et al. 20091 . iDodds-Eden et al. 2011T ). Such data, combined with 
spectral index and polarisation measurements as well as simultaneous multi-wavelength 
observations are pr oviding detailed constraints on the physical models of flare emission 
( Genzel et al. 201ol ). 



3.5 Galaxy Nuclei and Active Galaxies 

3.5.1 Black Hole Masses Our understanding that the growth of supermassive black 
holes is tied to that of their host galaxies is founded on measurements of BH masses. 
These have led to the emergence of relations between the BH ma ss and the velocity dis- 
persion, mass, and luminosity of the ste llar spheroid around it (jFerrarese fc Ford 2 005. 
Gebhardt et al. 200ol . lHarhig fc Rix 2004 ). AO is increasingly dominating what was once 



the domain of optical longslit spectroscopy with HST, because the high resolution it affords 
is coupled with a large collecting area (allowing one to study faint galaxies) and integral 
field spectroscopy (the 2D coverage of which enables a more robust re covery of the orbita l 
distribution), at near-IR wavelengths (to probe dust obscured nuclei). iDavies et al. (20061 ) 
showed that it is possible to measure BH masses in type 1 active galactic nuclei (AGN) 
using spatially resolved stellar kinematics, providing a complementary method to reverber- 
ation mapping which relies on tracking the temporal variability of the broad lines. For 
NGC 3227 they argued that Mbh was lower than previous estimates, lying in the range 
(7-20) xlO 6 M Q . The most recent reverberation mass of (7.6 ± 1.7) x 10 6 M Q derived from 
a well sampled light curve l|Dennev et al. 201p| ) is consistent with this range. Nevertheless, 
uncertainties for measurin g Mbh in quiescent or active spiral galaxies are likely to remain 
high due to data quality ( Krainovic et al. 20091 ) as well as systematics and degeneracies 
associated with the distribution function of the stellar orbits, the co-exist ence of multi- 
ple stellar populatio ns, and the presence of significant nuclear gas masses (|Davies 20 08a. 



Giiltekin et al. 20091 ). 



Coupled to this is the question of pseudo-bulges. Because they are built through secular 
disk processes rather than merger events (and therefore have different stellar populations, 
mass distributions, and kinematics), it is not clear whether the black hole properties should 



correlate with them in the same way as for classical bulges (jKormendv fc Bender 2011 



Orban de Xivrv et al. 201 ll ) . It is now realised that many local disk galaxies have at least a 
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R.A. offset (arcsec) 

Figure 10: 

AO integral field observations of the type 2 AGN NGC1068 (1"= 70 pc) in the 2.12 /jm H 2 line. 
Left: the central few arcsec show an expanding ring of molecular gas with the brightest emission 
on the northeast side. The location of the AGN is marked. The square denotes the region viewed 
in more detail on the right where gas is rapidly inflowing almost directly towards the AGN. Right: 
two tongues of emission are revealed in the central arcsec (line emission (top) and projected 
velocity (bottom); the line properties could not be extracted at the location of the AGN itself). 
The trajectories of the best fitting models of inflow are shown, and follow both the emission and 
its velocity — which on the north side is approaching (blue colours), and on the south side is 
receeding (yellow colours), a ccelerating as it nears the AGN (red colours). Adapted from 
iMuller Sanchez et al. (20091) . 



pseudo-bulge component to their central regions jFisher fc Drorv 201ll , lweinzirl et al. 20091 ). 
and so this may contribute to the scatter of disk galaxies on the Mbh — o* plane. Using 
AO to derive black hole masses in such galaxies, Nowak et al. I2009I , I2010I ) suggest that 
to really understand the co-evolution of BHs and bulges one does need to tease apart the 
pseudo- and classical bu lge components. 

For elliptical galaxies, Gebhardt et al. (201lh argue that the main sources of uncertainty 
on Mbh are the treatment of the dark matter halo, an incomplete orbit library, and triax- 
iality. To overcome them for M 87, these authors combine AO integral field spectroscopy 
with wider field data to find a mass of (6.6 ± 0.4) x 10 9 M . That this exceeds the mass 
expected from the Mbh — £>"* relation by twice its uncertainty suggests that the high mass 
end of the relation is poorly constrained and/or its scatter is larger than previously thought. 



3.5.2 Gas Inflow and Outflow Nearby active galactic nuclei (AGN) provide an 
excellent opportunity to study the mechanisms that drive gas towards the central BH. 
Their proximity means that with AO, scales down to a few parsecs can be resolved. How- 
ever, it is the combination of AO with integral field spectroscopy that is leading the way 
forward, by probing the full 2D distribution and kinematics of stars, molecular gas, and 
ionised gas. Such techniques, in the optical or in the near-IR using AO, have revealed 
inward flows of gas at relatively low rates along circumnuclear spiral arms in a number of 
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galaxies JPavies et al. 20091. iFathi et al. 20061 . iRiffel et al. 20081 . ISchnorr Muller et al. 201ll . 
Storchi-BCTgmamTeT*al" 2007h . Although there can be exceptions to this, such as the dra- 
matic case of NGC 1068 where the gas appears to be streaming almost directly towards the 
AGN jMiiller Sanchez et al. 20091 . Fig. [TO)) , such inflow is in principle sustainable for Gyr 
timescales (|Davies et al. 20091 ). It is believed to be associated with the circumnuclear dust 
structures that have been mapped in many active and inactive galaxies. The implication is 
that gas streaming ought to be common, but its relation to AGN accretion is still far from 
clear. 

Part of the reason for this is that such processes bring gas to the central tens of parsecs 
where one might expect a starburst to ensue; but the role of nuclear star formation in fuelling 
AGN is still open to debate. Ste llar population syn thesis of optical I Cid Fernandes et al. 2004 . 
barzi et al. 200rj ) and infrared (jRiffel et al. 2009al ) data suggest that while some AGN ap- 
pear to be associated with recent (< 100 Myr old) starbursts, the nuclear stellar spectra of 
others are dominated by inter mediate age or ver y old po pulations. A poss ible solution to 
this puzzle was proposed by ( Davies et al. 2007 . see also Wild et al. 2Q10h . Using AO to 
probe scales below tens of parsecs, they presented evidence that the youngest starbursts are 
associated with only weak AGN accretion, and that AGN fuelling is much more efficient in 
a post-starburst after the early phases of turb ulent stellar feedback have ceased. Hydrody- 
namical simulations ( Schartmann et al. 2010h tend to support this view, and have shown 
how stellar ejecta from a post-starburst may generate a turbulent 1-pc scale gas disk consis- 
tent with those inferred from maser observations. If this gas is related to the obscuring torus, 
the simulations imply that the torus may be a dynamically evolving structure with different 
components responsib le for the various observed phenomena, including small scales struc - 



tures seen with VLTI (|Burtscher et al. 20091. boidlRaban et al. 20091 . iTristram et al. 2007h 



as well as larger scale components jllicks et al. 20091 ). The use o f AO integral field spec 



troscopy to map the locations of different age stellar populations ( Riffel et al. 20101 . 2011) 
is a new aspect that should provide important clues in the emerging picture of how nuclear 
starbursts are related to the torus and to AGN fuelling. 

Seyfert nuclei require little gas to fuel their moderate luminosities, so one might ask 
whether the far larger inflowing gas masses are piling up or being expelled. Here also integral 

field spectroscopy couple d to AO is revealing details about AGN outflows via their ionised 

and coronal line emission dMiiller Sanchez et al. 201 ll . IRiffel et al. 2009bl . IRiffel fc Storchi-Bergmann 201 ll . 



Storchi-Bergmann et al. 2010h. in a way that is highly complementary to longslit spec 
troscopy with HST ( Crenshaw et al. 201lt ) . Since the outflow rates are 100 or more times 
greater than the accretion rate onto the BH, it seems that AGN driven winds are entrain- 
ing a significant amount of gas from the local ISM. Clues about the complex interaction 
between the outflow and the ambient ISM are also appearing from models of the outflow 
geometry. These are providing hi nts that the more molecular gas there is in the vicinity, the 
faster and narrower the outflow ( Muller Sanchez et al. 20 111 ). These first results illustrate 
how AO, by enabling one to probe the detailed physics of outflows in local AGN, is a key 
technology in understanding a phenomenon that has shaped galaxy evolution. 



3.5.3 Quasars and Mergers While it is possible to probe to small scales in nearby 
Seyfert galaxies, the more luminous QSOs provide a much greater observational challenge, 
due to their distance and the o ften overwhelming b rightness of the AGN with respect to the 
host galaxy. As emphasized bv lGuvon et al. (20061 ) in their survey of 32 nearby QSOs using 
Gemini North and Subaru, the key issue is knowing the PSF so that it can be accurately 
subtracted (see Sec. [4| in order to reveal the underlying host galaxy. At high redshift, 
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one has to contend with the (1 + z) 4 surface brightness dimming and so, even with AO, 
detecting the host galaxy can itself be difficult (jCroom et al. 20041 . iFalomo et al. 20051 ). 

The specific role of mergers in fuelling QSOs can be assessed by targeting the ~ 1% 
of QSOs that exhibit double peaked [Oni] lines, which could trace pre-coalescent dual 
AGN. Remarkably, AO ima ging of such c a ndidates shows tha t only 30-4 0% have double 



nuclei on kiloparsec scales (|Fu et al. 20111 . iRosario et al. 20111 '). Instead, IFu et al. (20111 ') 



argue that the origin of the double pe aked line profiles ma y lie in other processes such as 
outflows or jet-cloud interactions; and IRosario et al. f2011i ) suggest that most QSOs may 
not be associ ated with gas rich ma jor mergers, an idea that is gaining weight from other 
observations (jCisternas et al. 201lf ). As discussed in Sec. 13.61 the role of secular processes 
in galaxy evolution at early cosmic times is now also receiving increasing attention. 

Nearby luminous merging systems provide an alternative approach to this issue. One of 
the best known dual AGN, NGC 6240, has been the focus of several AO studies. These have 
highlighted numerous massive young star clusters around the nuclei whic h are undetected 
in optical HST images ( Max et al. 2003 ; Pollack. Max fc Schneider 2007 ). Comparison of 
AO images from K-band (2.1 /xm) to L-band (3.8 am) with X-ray and radio continuum 
data have revealed the location of the two AGN (|Max et al. 20071 ). Although the AGN 
themselves are hi ghly obscured, across most of the merger the extinction is rather less 
( Engel et al. 2010l): combining AO data with HST images shows it to be consistent with the 



Calzetti et al. (2000) reddening law, derived for isolated star forming galaxies rather than 



mergers. Spatially resolved stellar kinematics afforded by AO now provid e, in addition to th e 
tidal arm morphology, independent constraints on the merger geometry ( Engel et al. 201o1 ) ; 
and have yielded the intriguing result that th e BH in the souther n nucleus also lies above 
the high mass end of the Mbh — c* relation ( Medline et al. 201lh . 



3.6 The High Redshift Universe 



One of the most remarkable applications of adaptive optics has been in spatially resolving 
the internal structure and kinematics of star forming galaxies at z ~ 1.5-3, the epoch 
of peak mass assembly. These galaxies are usually no more than 1-2" across, and the 
optical emission lines that constitute prime diagnostics of their dynamical and physical 
properties are redshifted to the near-IR. They would make ideal AO targets, except that 
most deep fields where such galaxies are identified are specifically chosen to avoid bright 
stars. At the same time the low surface brightness means the resolution is often limited 
to 0.1-0.2" by instrumental and observational constraints (e.g. seeing variability over long 
integrations). Making use of AO in this r egime is difficult. For their la rge spectroscopic 
imaging survey of 63 high redshift galaxies, iForster Schreiber et al. (20091) used AO on onl y 



12 target s (a number that h as doubled in an extension to the survey. iMancini et al. 2011 ). 
Similarlv lLaw et al. (2009b[ ) detected only 13 of the ir target s using AO. IWright et al. (20091 ) 
observed only 6 galaxies and Wisnioski et al. (2011 ) 13, and Mannucci et al. (20091 ) selected 
10. Quantitatively, the severity of the selection criteria means that even when using a laser 
guide star, only about 10% of high redshift galaxies have a suitable guide star within 30- 
40"; and when one takes into account other criteria (e.g. ensuring that the faint emission 
lines fall in regions of high atmospheric transm ission but not too close to bright OH airglo w 
lines), the fraction goes down to ~ 1% or less ( Mancini et al. 2011 . Mannucci et al. 20091 ). 

Following the kinematical confirmation with AO that massive star forming galax ies at 
z ~ 2 are not necessarily mergers but can also be gas rich disks ( Ggjrzel_^t_al^_2006 ) , it i s 
now thought that about 1/3 of such galaxies are in fact disks ( Forster Schreiber et al. 20091 ). 
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Figure 11: 

AO integral field spectroscopy is now an important tool in probing the physical structure and 
dynamical state of high redshift galaxies. These data of the z ~ 2 galaxy ZC406690 reveal a 
coherent velocity field indicative of disk rotation (left), the individual star-forming clumps 
(center), and a blue wing on the spectral line profile of one of these clumps that traces a powerful 
outflow (right). In the right panel, the blue profile tracing the star formation and the red profile 
traci ng the outflow sum to give the green profile which is a fit to the observed spectrum. Adapted 
from iGenzel et al. (20111 1 . 



As shown in Fig. [TT]for one such object, these disks are unlike those of local galaxies. They 
are rapidly forming stars, often in giant star-forming complexes or clumps. Fitting beam- 
smeared disk models to t he kinematics shows that they have high intrinsic velo city dis- 



persions of 20-100 km s" 1 (jCresci et al. 20091 . Haw et al. 2009bl . I Wright et al. 2009h . Many 



studies now suggest that such dispersions are characteristic of normal massive high-redshift 
disks, and are connected to the high gas accretion rates through cold flows at early cosmic 
times, high gas fractions, and global instability to star formation. AO is a key capabil- 
ity because it separates the ~ lkpc (0.1-0.2") sized clumps from the inter-clump regions, 
which are blurred together in seeing-limited conditions. Even higher linear resolutions, cor- 
respondi ng to as little as ~ 100 pc, can be re ached if AO is applied to gravitationally lensed 
objects ( Jones et al. 20101 . Stark et al. 20081 ). Using AO with integral field spectroscopy is 
now shedding light on the detailed properties of the individual clumps and showing that 
they can drive powe rful winds, with outflow rates that may exceed their star formation rates 
(jGenzel et al. 201lh . This has important consequences for the clump lifetimes, and hence 
the evolution of the galaxies, for example whether t hey survive long enough to migrate in- 
wards and build a nascent bulge ( Genzel et al. 200c} ) or fuel an AGN (jAmmons et al. 20091 . 
Wright et al. 20101 1. 



Through the use of AO and integral field spectroscopy, and despite the difficulties of find- 
ing suitable targets, this field is now thriving. The observations, although still relatively 
few, have already led to fundamental developments in our understanding of galaxy evolu- 
tion. They are now driving the requirements for multi-object AO systems that can operate 
simultaneously on many integral field units deployed across a wide field (see Sec. 15.3.2]) . 



4 Point Spread Function 

Knowledge of the point spread function (PSF) is of obvious importance for analyzing AO 
data sets, yet the PSF delivered by adaptive optics systems has an unfavourable reputation, 
based on its complex shape and its spatial and temporal variability. While the traditional 
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advice to cope with this has been to observe a PSF reference star and use it to deconvolve 
the data, there are in fact a variety of options available to both measure and compensate for 
the PSF. Despite this, the limited applicability of all these methods means that the issue of 
recovering the AO PSF remains an unsolved problem. As such, our aim in this section is to 
set the scene for future developments that might assist in quantifying spatial and spectral 
PSF variability, and ultimately in motivating efforts towards PSF reconstruction. 

We begin by discussing how the PSF is used to extract the desired information from the 
data. Despite the variety of scientific analyses described in Sec. [3] the PSF is typically used 
in only a few different ways. We outline the aims of these methods, and look at the accuracy 
with which they require one to measure the PSF. In the second part of the section, we turn 
to how one might achieve this. There are a number of methods to empirically measure the 
PSF, either with additional observations or from the science data itself, but none of them 
are globally applicable. The alternative of reconstructing the PSF from wavefront data 
would be preferred by astronomers, but implementing it robustly is still a major challenge. 



4.1 The Role of the PSF 

The main methods of PSF-dependent post-processing can be broadly grouped into four 
classes: (i) deconvolution, (ii) model convolution, (iii) point source photometry and as- 
trometry, or (iv) speckle suppression. These regimes, which are discussed in the following 
sections, disentangle the scientific information from the effects of the PSF in very different 
ways depending on the aim of the analysis. 

4.1.1 Deconvolution The intensity distribution across an image / corresponds to the 
convolution of the true intensity distribution of an object O with the PSF P together with 
some additive noise N: I = P&O + N . The aim of deconvolution is to infer the unknown O. 
During post-processing, it is commonly applied in solar astronomy, asteroid studies, and 
imaging of planetary surfaces and atmospheres, targets which all contain structure that 
does not easily lend itself to characterisation w ith a simple analy tical model. Generally, 



deconvolution does not lead to super-resolution ( Starck et al. 2002 ). Instead, the aim is to 
enhance low contrast features in the data, in effect by removing the characteristic broad 
AO PSF halo that dilutes them. Such techniques are particularly important for generating 
dopplergrams (i.e. difference images of the red and blue wings of an emission line) of solar 
features, which are sensitive to va riations in AO performance and hence PSF core/halo 
c ontrast jRimmele fc Marino 2011 ') 



Starck et al. (20021 ) describe a number of methods to perform deconvolution based on 



different mathematical approaches, each of which is suited to specific circumstances, for 
example whether the astronomical object has well defined edges (e.g. planets, asteroids) 
or not (e.g. galaxies). However, there are many issues that must be carefully controlled. 
Chief among these is noise amplification, to which deconvolution, as an inverse process, is 
inherently prone. Another key issue is the limb brightness of planetary surfaces, which can 
be artifically enhan ced due to Gibbs oscillations resulting from a high luminosity gradient 
( Hirtzig et al. 200rJ ). Such problems can be reduced by explicitly inclu ding terms in the 



algorithm to preserve the edge properties, as was done in the AIDA code (|Hom et al. 2 007) 
which was developed for processing asteroid im ages (Fig |12[l . Its implementation is a specific 
extension of MISTRAL (jMugnier et al. 2004h . a myopic restoration tool. The strength of 



myopic deconvolution is that it does not assume the PSF is perfectly known. Instead, it 
estimates the PSF and object simultaneously, using a number of PSF reference images in 
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Figure 12: 

Left: Keck II AO images, taken 2 hours apart, of the asteroid 9 Metis in 2 orientations. The 
deconvolution with AIDA has enhanced the contrast of the surface features. Right: the features 
across the deconvolved images match the shape inferred from the independently derived lightcurve 
inversion model at the appropriate orientati on, and hence remove the pole ambiguity from that 
model. Adapted from lMarchis et al. (2006bf) (courtesy of F. Marchis). 



addition to the object frames, and applying soft constraints to its properties. By using priors 
in this way, it is more robust than blind convolution, which makes no assumptions about the 
PSF. On the other hand, blind deconvolution is applicable to solar images where no image of 
a PSF can be taken. Instead, prior constraints are provided by a careful optical design that 
yields simultaneous imaging of 2 or more narrow bands, together with defocussed data that 
enable one to make use of phase diversity methods. Using such data it is possible co rrect 



high order aberrations post-facto (|van Noort. Rouppe van der Voort fc Lofdahl 20051 1 



4.1.2 Model Convolution The requirement to disentangle object data from the PSF 
does not necessarily mean that a classical deconvolution must be performed. If the target 
can be reasonably described by an analytic expression, then this can be convolved with the 
PSF and the result compared directly to the observed data. Thus, given / = P g> O + N , 
one defines a model M that provides an estimate of the observed intensity distribution 
I = P ® M. Iterative improvements to the model M lead to J ~ /, at which point 
it is reasonable to assume that M ~ O. Because typically only a few parameters are 
needed to specify a model, the problem is highly constrained and does not lead to noise 
amplification inherent in deconvolution. Perhaps the most important advantage of this 
method is that it deconstructs the intrinsic source properties so that the final product is 
a set of parameters (rather than another image) which can directly provide insights into 
the source itself. The methodology also enables one to make reliable estimates of the 
uncertainties of the parameters, because it is straightforward to quantify the impact of 
changing them. Since it is often the inevitable simplicity of the model rather than the PSF 
that limits how well the convolved model matches the observed data, one does not require 
a highly detailed measurement of the PSF. It can be sufficient just to know the strength 
and extent of the wings, which enables one to represent the core and halo of the PSF in a 
simple way as a pair of bivariate Gaussian or Moffat functions. 



Adaptive Optics for Astronomy 27 



Being well suited to data with low signal-to-noise, the method is frequently applied to deep 
images of distant galaxies, the light intensity profiles of which are commonly parametrized 
using a Sersic function. But while fitting a simple function to a galaxy profile is the strength 
of model convolution, it is also a weakness: one only knows how well that particular family of 
models matches the data, and it is not always clear whether (or how) the fit may be biassed 
by complex structure that is not accounted for properly. It is to overcome this limitation 
that perturbations in the form of Fourier modes have been included to the azimuthal shapes 
in the popular galaxy profile fitting code GALFIT3 ( Peng et al. 201oT ). 



Kinematics are a special case where deconvolution is not an option because of the inter- 
dependency imposed by the PSF between the observed luminosity, velocity, and dispersion 
(velocity width). In this case, the only alternative to model convolution is to deconvolve 
each spectral plane of the original datacube, noting that the PSF that varies gradually 
from one plane to the next. Early work by iFerruit. Pecontal fc Wilson f2000l ) used an 



independent higher resolution image to guide such a deconvolution, leading to a factor 3 
gain in spatial resolution for the kinematics of the [O III] outflow in Mkn573. More recent 
efforts have focussed on the specific conte xt of extracting the s pectrum of a supernova (i.e. 



point source) superimposed on a galaxy (jBongard et al. 20111 ). but the method has more 
general applications. 

4.1.3 Point Source Photometry and Astrometry The information contained in 
images of stellar fields (from binary stars to entire star clusters) comprises the flux and 
position of each source. In densely populated fields, because the sensitivity is limited 
by the crowding rather than photon and detector noise, the key to extracting accurate 
photometry and astrometry is in obtaining a good estimate of the PSF with which to fit 
the sources. In addition it is important to quantify the flux level at which one can no 
longer distinguish between faint sources and residual speckles or other noise features, since 
this directly relates to the spatially variable completeness correction that must be applied 
to the derived number counts. Various codes have be en developed ex plicitly for processin g 



stellar fields, most notably DAOPHOT (jstetson 19871 ) and StarFinder (jPiolaiti et al. 20001 ) 



Within DAOPHOT, the reference PSF is based on analytical fits to several stars (with a 
look-up table for small empirical corrections), and so is able to handle undersampled data 
as well as accommodate some degree of spatial variation. On the other hand, StarFinder 
was designed to work on AO data with complex PSF shapes, and so uses an empirical PSF 
that is extracted from a combination of various stars in the field. Although it currently does 
not account for anisoplanatism, a local PSF can be extracted for each distinct isoplanatic 
region. Crucially, even though the core of the PSF may be very narrow, with a FWHM 
of ~ 50 mas, high photometric precision requires that the reference PSF is determined out 
to at least 1" so that the faint wings are properly characterised. It is because of this that 
Schodel (2O10l ) argues that, if one is careful, the process can be improved by first applying a 



linear (e.g. Wiener) deconvolution, which reduces the strength of the PSF wings and makes 
the core brighter and narrower. 

4.1.4 Speckle Suppression High contrast imaging takes the requirements on PSF 
knowledge to extremes, and has led to the instrumental and observational requirement that 
the complex speckle pattern in the wings of the PSF cancels out when data are combined. 
This requires very detailed characterisation of the PSF, which can be done under specific 
conditions. Here we outline several ways that have been developed to achieve it: 
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Chromatic methods include simultaneous differential imaging (SDI) and spectral decon- 
volution (SD). SDI makes use of the different spectral energy distributions of the star and 
its companion. One can then define two narrow bands close to each other in wavelength 
so that in a difference image, the speckles will subtract out but the companion does not 
(e.g. because methane absorption longwards of 1.62 in the cool atmosphere of a late 
type brown dwarf m akes it almost disappear in an image taken at this wavelength). SD 



( Sparks fc Ford 20021 ) makes use of the fact that the radial location of the speckles from 
the centre of the PSF is proportional to wavelength, while the location of a companion with 
respect to the star is fixed. Re-scaling all individual images of an IFS cube proportional to 
its wavelength aligns the speckles but makes the planet move inwards with increasing wave- 
length. Speckles are now well fitted by a smooth (e.g. a low-order polynomial) function to 
each pixel along the cube's dispersion axis while the planet produces a narrow bump while 
traveling through the pixel at a certain wavelength range. Since this bump is badly fitted 
by the smooth function, the subtraction of the fit removes most of the speckles and leaves 
the planet. This latter method has been u sed to reach 9 mag contrast at 0.2" separation 



without a coronagraph (|Thatte et al. 20071 ) 



Polarimetric differential imaging (PDI) is based on the fact that at small angular radii, 
speckles remain unpolarised. Thus, if two images at orthogonal polarisations are taken 
simultaneously, the speckles will subtract out. Only the polarised light is left, which might 
trace, for example, scattered light from a circumstellar disk or even the atmosphere of an 
exoplanet. The potential of this technique was first demonstrated on the T Tauri star 
TW Hya (|Apai et al. 2004h . and a highly sensitive differential polarim eter will be used for 



the VLT exoplanet imaging instrument SPHERE i Beuzit et al. 20061 ) 



Temporal metho ds exploit rapid variations in the speck le pattern. The 'dark speckle' 



method proposed bv lLabevrie (19951 ) and demonstrated by iBoccaletti et al. (200 11 ) makes 
use of the fact that the rapidly changing speckle pattern will never reach its minimum 
level (which is dependent on the static long-exposure pattern) in locations where there 
is emission from a faint companion. However, while this technique enables one to detect 
faint companions, it excludes performing photometry. A parallel method also based on 
using statistical techniques to distinguish speckles from comp anions, but enabling one to 



estimate the intensity of the companion, has been proposed bv lGladvsz et al. (20ld ) 



Spatial methods such as angular differential imaging (ADI) rely on the fact that the 
major sources of quasi-static speckles arise within the telescope and instrument. ADI can 
be used if the field of view, and hence the image of the exoplanet, is allowed to rotate 
with respect to the telescope/instrument configuration and henc e the quasi-static speckl e 



pattern. With suitable processing of such a sequence of images (e.g. lLafreniere et al. 2007bl ). 
one can cancel the speckles while enhancing the signal from a companion. ADI is rather 
simple to implement on alt-azimuth telesco pes, because field an d pupil rotate with respect 
to each other. It was first demonstrated bv lMarois et al. (20061 ) and is now widely used for 
high-contrast imaging. 

Deep Suppression of speckles over a limited part of the PSF is possible through phase 
correction by the DM. The Fraunhofer diffraction formula states that the complex amplitude 
A(x)e l ^ x ' of light is essentially propagated between aperture and image plane by a Fourier 
transform. Absorbing the amplitude into a complex phase 4>c( x ) and considering small errors 
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e x ~ 1 + ai, we see that sinusoidal phase or amplitude errors across the aperture produce two 
speckles of light that are symmetric to the image center FT(sin(ax)) oc S(f — a) — S(f + a). 
The angular separation of these speckles from the center grows with spatial frequency of the 
ripple. Since the DM has a spatial frequency cut-off due to its finite number of actuators, 
it can only produce or correct speckles up to a maximum angular separation 8ao called the 
correction radius. In order to suppress a coherent patch of light created by phase and/or 
amplitude aberrations inside the correction radius, the DM just needs to create an anti- 
speckle of the same amplitude but with a 7r phase-shift at the same location. Obviously 
this scheme would work perfectly for pure phase aberrations in the aperture since the DM 
would simply have to flatten the phase. Unfortunately, the phases of the symmetric speckles 
created by a phase sinusoid cos(fcx + 8), e.g. created by the DM, are 7r/2 — 8 and n/2 + 8 , 
while the phases of speckles created by an amplitude sinusoid are 8 and —8 (jGuvon 20051 1. 
Hence, only one of the amplitude speckles can be suppressed by an aperture plane DM 
at the expense of an amplification of the other one. Exploiting the Talbot effect, i.e. the 
transformation of a phase sinusoid into an amplitude sinusoid by Fresnel propagation, it is 
however possible to correct all image plane speckles over a certain wavelength range using 
a second DM. Algorithms that describe in more detail how to su ppress speckles and c reate 
a dark hole with a single DM were presented by, for example, iGive'on et al. (20071 ) and 
Guvon et al. (20101 ); and a suppression of the diffracted and scattered light near a star-like 



source to a level of 6 x 10~ 10 at 4 A/D using such methods has been demonstrated in the 
laboratory (jlrauger fc Traub 20071 ). 



4.2 Estimating the PSF 

We have seen that there are various ways in which a PSF might be used for a scientific 
analysis. Similarly, there are a variety of options for estimating the PSF. However, each 
of them has limited applicability, and so we consider also PSF reconstruction based on the 
wavefront sensor (and ancilliary) data, and discuss whether this is a viable alternative. 

4.2.1 Empirical Extraction The four methods described here illustrate that there 
are often circumstances which make it possible to derive at least some information about the 
PSF from observations. However, they also emphasize that this is usually at best a crude 
approximation, and that independent guidance and confirmation about the PSF shape is 
much needed. 

Targeted Reference Star Observers are often recommended to take images of an iso- 
lated reference star in order to estimate the PSF. This can work well if a number of implicit 
assumptions are fulfilled: (i) the atmospheric conditions are sufficiently stable that a short 
and temporally distinct exposure on the PSF star is a good representation of the PSF in a 
long science exposure (a seeing monitor can help to verify whether this condition is met); 
(ii) the intensity and distribution of flux on the WFS is the same for both the PSF refer- 
ence and the science target's guide star (which is not necessarily the case for non-stellar 
guide stars such as galaxy nuclei, which may be extended and/or superimposed on a bright 
background); (iii) if an off-axis guide star is used for the science target, the anisoplanatism 
for the PSF reference should be the same (i.e. one may need a pair of stars at a specific 
separation, one matching the guide star magnitude and the other for the science camera). 
Clearly, there are situations where taking targeted images of a reference star can be suc- 
cessful, particularly if several such images are used for myopic deconvolution as described in 
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Sec. 14.1.11 However, even if one accepts the reduced observing efficiency such observations 
imply, it is often hard - or even impossible - to meet all 3 conditions. 



Unresolved Sources in the Science Field For observations of stellar fields, we have 
seen that the standard procedure is to extract the PSF using several stars in the field. This 
approach can also be applied to other science targets if there are unresolved sources in the 
vicinity. More often, such sources will be distributed over a wide field and so it will be 
necessary to account for anisoplanatism. Methods have been developed to estimate an off- 
axis PSF by making use of knowledge of the C%(z) distribution through the atmosphere; and 
in principle these could be turned around to derive an on-axis PSF from off-axis stars. As an 
alternative, there has been some effort to show one can use calibration frames containing 



many stars to quanti f y emp irically how the PSF varies across a field (|Cresci et al. 2005 



Steinbring et al. 20021 . I2005T ). Applying this to a science field where there are at least a few 
stars then enables one to interpolate the PSF at any point in the field. The approximate 
nature of the inferred PSF makes this method suitable for model convolution. However, it 
is impractical for spectroscopy because the field of view is simply too small. 

Within the Science Target Rather than extract the PSF from stars in the field or 
from observations of other stars, any feature in the science target that is spatially unresolved 
can be used to estimate the PSF. Chromatic, polarimetric, and angular differential imag- 
ing (Sec. I4.1.4[) are special cases where the instrumentation and observing technique are 
designed to provide a PSF reference image. Similarly, there are a number of different com- 
ponents contributing to the total spatial and spectroscopic emission from AGN. In Seyfert 
nuclei, the broad line region is only a few lightdays across and is always unresolved (in 
QSOs, its size can be measured in lightyears but, because of their greater distance, is still 
unresolved). Alternatively, the near-IR non-stellar continuum, which usually arises from 
hot dust close around the AGN, is only 0.1-1 pc across and hence unresolved on 8-10-m 
class telescopes at distances of ~ 20 Mpc or more. The spatial distribution of both these 
quantities can be extracted using the spectral information in a datacube. This method of 
deriving the PSF can be successful at yielding its overall shape without fine detail, and 
so is suited to spectroscopic studies where one uses a model to derive the intrinsic source 
properties; but it is applicable to only a limited number of the science targets. 

Comparison to Higher Resolution Data In some circumstances it may be possible 
to make use of a higher resolution broad-band image to infer the PSF (for example when 
analysing kinematics from undersampled integral field spectroscopy). The rationale is that 
the observed intensity distribution I is the convolution of the intrinsic object O with a PSF 
P so that / = P®0. One can find a broadening function F which, when convolved with the 
higher resolution observation J;,, reproduces the lower resolution observation 7) = F ® Ih- 
The lower resolution PSF is then Pi — F ® Ph. While this can be successful, the point of 
AO is to yield the highest resolution data, making this a niche method. 

4.2.2 Reconstruction None of the empirical approaches described above is applica- 
ble under a wide range of observation conditions and ast ronomical targets. An alternative 



method was therefore introduced by IVeran et al. (19971 ) and successfully applied to the 



low-order curvature AO system PUEO. This method models the optical transfer function 
(OTF, the Fourier transform of the PSF) as the product of three contributors, the OTFs of 
i) telescope and instrument, ii) high spatial frequency turbulence that cannot be corrected 
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by the DM, and iii) the low spatial frequency AO corrected residual wavefront. While the 
contribution of telescope and instrument is supposed to be quasi-static and can be mea- 
sured, and the uncorrected turbulence is supposed to follow Kolmogorov statistics described 
solely by the Fried parameter ro, the determination of the AO residual wavefront requires a 
deep understanding of the AO control dynamics and WFS characteristics as well as a priori 
modeling assumptions on atmospheric turbulence and noise characteristics. 

The expected difficulties with making PSF reconstruction work in more complex cases 
(low signal-to-noise, off-axis guide star, or when using a LGS), made Jolissaint et al. (2010h 
speculate about alternative ways to retrieve a reasonable PSF estimate that could still be 
useful for astronomical applications. One approach could be to derive the PSF using an- 
alytical models which are fed with some basic parameters (turbulence profile, AO system 
parameters, guide star parameters) and heuristically tuning it to the actual system PSF. 
Another approach to reconstruct PSFs would be to make use of point sources in the field 
an d estimate the PSFs lying in between either by a generalization of the method presented 
by iBritton (2006) to derive the anisoplanatic degradation, or by interpolating the few pa- 
rameters of an empirically determined analytical PSF profile (e.g. Moffat distribution). 
Certainly, formulation of a proper metric and required accuracy for PSF reconstruction, 
widely accepted in the astronomical community, is a necessary prerequisite for the devel- 
opment of an a ppropriate algorithm. The new work on PSF reconstruction started by 



Jolissaint (20111 ) may provide a basis from which a suitable metric can be developed. 



5 Novel Techniques 

The science described in Sec. [3] was performed using SCAO systems. These have only a 
single deformable mirror to correct the wavefront perturbations, which are measured using 
a single reference source (or with a laser guide star, a single source for each of the high order 
and tip-tilt components). The scientific applications of such systems are constrained by the 
limited performance, particularly at shorter wavelengths, the small field of view, and poor 
sky coverage (see Sec. [2]). In this section we discuss the motivation for novel alternative 
schemes for wavefront sensing and correction that promise to widen the parameter space of 
adaptive optics. We focus on concepts that have already been implemented and tested on 
sky, or are currently being built. 



5.1 Laser Tomography AO 



We have seen in Sec. 12.41 that while using a LGS improves the sky coverage with respect 
to NGS, there are still limitations, notably the cone effect. For a sodium LGS on an 8- 
10-m class telescope, the cone effect reduces the Strehl ratio obtainable by a factor 0.85 
in K-band and 0.6 in J-band. This moderate impact would become devastating for the 
next generation ELTs, limiting the maximum achievable K-band Strehl to only 15%, and 
much less at shorter wavelengths. In order to overcome the cone effect, measurements 
from several LGSs can be combined to fully reconstruct the turbulence column in the 
direction of the astronomical target (lower left panel in Fig. If 30 . The separation of the 
LGSs can be significantly larger than the isoplanatic angle 8o, but their beams should still 
overlap at the high est turbulent layer in order to prevent there being unsampled turbulence 
( Fusco et al. 20foh . The reconstructed volume is then collapsed along the third dimension 
and finally projected onto a single DM. This technique is called laser tomography AO 
(LTAO) and has been tested in the laboratory ( Costille et al. 201ol ) . 
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Ground Layer Adaptive Optics Mufti- Conjugate Adoptive Optics 




Loser Tomography Adaptive Optics Muiti -Object Adaptive Optics 




Figure 13: 

Conceptual drawings of various novel AO concepts (courtesy of E. Marchetti, ESO). Clockwise 
from top left: Ground Layer AO, Multi-Conjugate AO, Multi-Object AO, and Laser Tomography. 



By involving a tomographic wavefront reconstruction, LTAO provides both a scientific 
and a technical stepping stone towards the (more complex) capabilities described later. It 
also permits a degree of flexibility in how the wavefront correction is done. If the science 
requires a high Strehl on a single target, the wavefront correction can fully overcome the 
cone effect. Calculations for ATLAS on the European ELT suggest that, optimised on-axis, 
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one can reach about 50% Strehl in the K-band (|Fusco et al. 2010h . For extended sources, 
one can adjust the weighting of the wavefront errors as they are projected onto the DM to 
provide a more uniform, but more moderate, performance over a larger field. In the most 
extreme case, one can achieve a modest performance of 10-15% Strehl over a field as wide 
as 120". At the same time, the area over which one can search for a NGS to compensate 
the lack of tip-tilt signal from each LGS is widened, improving sky coverage. 



5.2 Seeing Enhancement 

In the preceeding sections we have seen that sometimes, due to the limitations of objects 
(e.g. surface brightness or size) and instrumentation (e.g. affordable detector area), one 
must perform a trade-off between signal-to-noise, resolution, and field of view. It means 
that there are many situations where one cannot make full use of the diffraction limit, but 
can still benefit from enhanced resolution. On the other hand, if the tip-tilt star is very 
faint, one may find there is significant residual jitte r, augmented by ani soplanatic effects 



from the unavoidable offset to the science target. Davies et al. (2008bl ) suggested that, 
extrapolating this regime to the limit, it is conceptually only a small step to dispense with 
tip-tilt completely when using a laser guide star; and that by doing so, one can achieve 
spatial resolutions down to 0.2" with 100% sky coverage. Such a mode of operation has 
been commissioned at the VLT, where the fast tracking at 30 Hz frame rate performed 
by the telescope itself compensates for vibrations such as wind shake, and is effective for 
observations of single sources. 

5.2.1 Ground Layer Correction Enhanced resolution over a wide field - in princi- 
ple up to ~ 10' - would bring many advantages. An obvious application is to stellar fields: 
one can perform the analysis simultaneously over the entire field, rather than splitting it 
into isoplanatic pieces. This has two immediate benefits: the completeness correction is no 
longer dependent on distance from the guide star, and it increases the potential for deriving 
a better PSF reference. Even if the science target is a single source, uniform correction over 
a wide field vastly improves the ability to empirically measure the PSF. 

Multi-object spectroscopy would also benefit enormously from such a capability, especially 
for extragalactic studies. As we have seen in Sec. 13.61 the morphologies, kinematics, star 
formation rates, and metallicities of high redshift galaxies provide crucial insights into 
galaxy formation and evolution. Key issues still to be resolved include the respective roles 
of mergers and secular evolution in galaxy growth; how and when the various elements of 
the Hubble sequence arose; how the galaxies acquired their angular momentum; and the 
roles played by AGN and stellar feedback in regulating BH and galaxy growth. These all 
require deep, spatially resolved spectroscopy of faint sources with sizes of 0.1-1". On 8-10- 
m class telescopes, seeing limited conditions do not provide the necessary spatial resolution, 
while at the diffraction limit signal-to-noise is a major constraint due to the faint surface 
brightness of the spatially extended regions. Intermediate resolution offers a reasonable 
compromise, and having it over a wide field adds a very important multiplex advantage - 
a combination that is likely to be competitive also for the ELTs. 

Such gains can be achieved using ground layer adaptive optics (GLAO, upper left panel 
in Fig. I13[). where only the atmospheric turbulence close to the ground is corrected. The 
key issue is that light rays from all objects, regardless of their angular offset from the field 
centre, pass through the same low layer of turbulence because it lies directly in front of the 
telescope mirror. Correcting it, by combining the wavefronts from several well separated 
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reference sources so that signal from the ground layer is strengthened while the differing 
contributions from high altitudes cancel out, therefore enhances the resolution over a wide 
field. An intriguing al ternative is use a single low altitude Rayleigh LGS, as demonstrated 
by the GLAS system ( Benn 20081 ). Here the cone ef fect is beneficial because high altitude 
turbulence must not enter the GLAO measurement (|Tokovinin 2004T ). 

Since at most astronomical sites, much of the turbulence does exist clos e to the ground, the 
GLAO gain can be s ubst antial (perhaps m ost dramatically in Antarctica; Travouillon et al. 20091 ). 



Rigaut et al. (20021 ) and Tokovinin (2004 ) have shown there is a trade-off between field of 



view and performance, which is roughly independent of telescope size. The height H (m) 
to which turbulence can be fully compensated depends on the projected actuator spacing 
d (m), which is matched to ro, and the field of view <j)o (arcsec) as H ~ 2 x lO 5 d/0o. Thus 
for wider fields of view, only turbulence closer to the ground can be corrected, limiting the 
resolution gain. Detailed simulations for the ARGOS system, which is being installed on 
the LBT, suggest that GLAO should yield a factor 2- 3 improvement in F WHM across a 
240" field of view, with a median resolution of 0.35" ( Rabien et al. 20101 ). This level of 
performance has already been validated on the MMT, where the first system to average the 
wavefront from multiple laser guide stars yielded a gain i n K-band resolut ion from 0.70" to 
0.33" uniformly across a 120" field |Baranec et al. 20091 . Irlart et al. 2o"lcl ). One important 
aspect of GLAO is that it can function even in poorer atmospheric conditions where high 
order AO struggles. It is this that will make it such an important scientific tool in the 
future - especially for the ELTs - making poor seeing useful and good seeing excellent. 



5.3 Wide Fields at the Diffraction Limit 

The scientific motivation for reaching the diffraction limit across a wide field is the same as 
for GLAO but, for 8-10-m class telescopes, with an additional factor 4-6 improvement in 
resolution and point source sensitivity. Here we describe the rationale for two contrasting 
methods that achieve this. The first provides diffraction limited performance across a 
contiguous field, the second at a number of selected points within a large patrol field. 

5.3.1 Multi-Conjugate AO In solar astronomy, the short wavelengths and day- 
time observations yield a usable field of view with SCAO of order only 10", which is 
not enough to encompass most sunspots or active regions, nor to efficiently monitor flare 
trigger mechanisms. This then is a strong motivation to develop wide field AO cor- 
rection. It can be achieved using multiple reference sources in different directions so 
that the columns of atmospheric turbulence they probe overlap partially at high alti- 
tude and fully at low altitude. In principle it is then possible to infer the full 3D struc- 
ture of the turbulence. In practice one attributes the measured perturbations to 2 or 3 
layers at specific heights and uses separate deformable mirrors, one conjugated to each 
layer, to correct them - hence multi-conjugate adaptive op tics ( MCAO, upper right panel 



in Fig. [13]). MCAO was first discussed by iDicke (19751 ) and Beckers (19881 ): and the 
first systems, de veloped in the context of solar astrono my, were tested at the Vacuum 
Tower Telescope jBerkefeld. Soltau fc von der Liihe 20oi ) and at the Dunn Solar Telescope 



( Langlois et al. 20041 ). increasing the diameter of the corrected field to as much as 60" 



Planetary, galactic, and extragala ctic science had to wai t until the MCAO demonstrator 
MAD was commissioned at the VLT ( Marchetti et al. 20081 ). The system used natural guide 



stars to deliver near-IR resolution down to 0.1" across a 120" field. Despite the limited sky 
coverage, numerous results have been published. One that typifies the potential of MCAO is 
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Figure 14: 



Left: K-band image of Terzan5 taken with MAD, which provided 0.1" resolution over the entire 
60" field. Right: Colour magnitude diagram of the cluster, with the two horizontal branch 
populations indicated. At to p left, the arrow ind icates the impact of reddening; uncertainties are 
indicated to the right. From lFerraro et al. (2009T ) (courtesy of F. Ferraro). 



discovery of two stellar populations in the star cluster Terzan5 ( Ferraro et al. 20091 ). These 
are clearly separated in the K, J—K colour magnitude diagram (Fig |14l) because the near-IR 
data are only mildly affected by the differential extinction. Both populations contain several 
hundred stars but the brighter horizontal branch clump is redder, more compact, more metal 
rich ([Fe/H] ~ 0.3), and younger (~ 6Gyr) than the fainter clump (iron abundance [Fe/H] 
~ —0.2 and age ~ 12Gyr). The authors argued that these characteristics are what one 
might expect if Terzan5 was the remnant of a dwarf galaxy that had been disrupted as 
it merged with the Galactic bulge; with the implication that galactic spheroids may have 
been assembled hierarchically from pre-formed, evolved stellar systems. 

MAD tested two different ways to achieve the wide field AO correction. For classical 
'star oriented' MCAO, the wavefront signals from several guide stars (laser or natural) 
are measured on separate detectors, and combined afterwards during the computation of 
how mu ch turbulence ori ginates from the different layers. An alternative 'layer oriented' 
concept i Ragazzoni 200ol ) measures each layer separately by optically combining the guide 
star signals for that layer onto a single detector. Each layer is then corrected independently, 
making the approach computationally simpler. In both cases, the quality and uniformity 
of the correction depend on the brightness and locations of the guide stars - which can be 
controlled far better using laser guide stars. The first laser MCAO system will operate in 
a star oriented mode using 5 LGS and typicall y 3 tip-tilt stars: GeMS is currently being 



commissioned on the Gemini South tel escope (Neichel fc Rigaut 201 if ). A layer oriented 



NGS system is being built for the LBT (jHerbst et al. 20081 ). 

An additional advantage of MCAO is that the sky coverage is higher than for SCAO, but 
this can be improved still further by making use of 'image sharpening' at near-IR wave- 
lengths for the tip-tilt stars. The design of the Keck Next Genera tion AO system include s 
DMs specifically to correct the wavefront from the tip-tilt stars ( Wizinowich et al. 20ip| ). 
For future ELTs, tip-tilt stars can be selected within the wide region beyond the science 
field where there is s till a substantial co r rection, leading to a sky coverage exceeding 50% 
at the Galactic Pole (jPiolaiti et al. 2O10l . Ilierriot et al. 2O10l ). 
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Technically the performance of an MCAO system is limited by a number of practical 
considerations. These include the finite number of guide stars, and the fact that the insen- 
sitivity of LGS to tip-tilt propagates into the quadratic modes for MCAO so that several 
NGS are required. From a scientific perspective, arguably the most imp ortant issue is the 



field of view that can be corrected. Rigaut. Ellerbroek fc Flicker (20001 ) have shown that 



this is limited by the number Ndm of DMs in the AO system to cf>o ~ 10 6 dNont /H where 
H (m) is the height to which the turbulence is corrected, d (m) is the actuator spacing 
(matched to ro) and <f>o (arcsec) is the field of view. For example (and part of the rationale 
behind the design of GeMS), if the turbulence extends up to 12 km, the condition above 
means that one needs 3 DMs to achieve a K-band Strehl of 80% uniformly across a 60" 
field. That MAD was able to provide 25% Strehl over a 120" fi eld with only 2 DMs is 
related to the distribution of turbulence through the atmosphere. iMarchetti et al. (2008) 



showed that on these occasions more than half of the turbulence was in the ground layer 
at altitudes < 500 m. To achieve high Strehl ratios over fields wider than this, MCAO is 
impractical. 

5.3.2 Multi-Object AO Surveys of high redshift galaxies require multi-object (inte- 
gral field) spectroscopy to probe the evolution of their star formation histories, metallicities, 
and internal dynamics. The number density of galaxies for which this can be done is at 
most a few per square arcmin, so patrol fields of tens of square arcminutes are required 
to make such programmes observationally efficient. In this respect, MCAO is of limited 
use: the corrected field is too small, and neither the instrumentation nor the science aims 
make use of the contiguous field. Instead, multi-object adaptive optics (MOAO) has been 
developed with the purpose of targeting specific objects within a large patrol field. 

This is achieved by having a number of wavefront reference sources in and around the 
patrol field which are used to reconstruct the 3D structure of the turbulence within it. The 
interpolated wavefront towards each science target is then corrected in open loop using 
a deformable mirror that has been allocated to that target (lower right panel in Fig. 1131 
see also Sec. I2.4[) . To validate this unusual approach, several MOAO developments are 
currently unde r way. These have led to on-sky tests of open loop adaptive optics correction 
in the I-band ( Andersen et al. 20081'). as the first step to an MOAO system that will be 



commissioned on Subaru ( Conan et al. 2010l ) ; as well as of a true MOAO correction in the 



H-band using several guide stars in the field (jGendron et al. 201ll ) 



5.4 Extreme AO 

At the opposite end of the AO parameter space is the aim to provide exceptionally high 
performance on bright (< 10 mag) stars. Reaching a Strehl ratio in excess of 90% in the ri- 
band is just the first step in a chain of comp lementary optical and data proc essing techniques 



optimised to probe deep within the PSF ( Qppenheimer fc Hinklev 2009h . but one that is 



absolutely necessary for the rest to work well. The primary scientific goal of this effort is to 
study young and evolved giant exoplanets orbit ing nearby stars in wid e 1-100 AU orbits that 



are inaccessible to current Doppler techniques (jMacintosh et al. 20071 ) . This can be done by 
direct imaging of the circumstellar environment, at a contrast ratio of 10 5 within 0.1" (and to 
10 9 at 0.04" with ELTs), and several orders of magnitude more at 1". Targeting the nearest 
stars will probe the smallest orbits, and hence provide the opportunity to detect planets 
via reflected stellar light. In contrast, planets around young (0.1-1 Gyr) stars - particularly 
stars within nearby young associations - will be self-luminous and can be detected to low 
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masses through their own emission. Focussing on known planets with systematic residuals 
in their radial velocity curves will provide the best chance of detecting additional planets 
in wider orbits. Combining these approaches will lead to a clearer understanding of the 
regime in which planets can form via core accretion, the role of migration in the evolution 
of planetary systems, and whether our solar system, with giant planets at 5-10 AU, is 
unique. 

While the AO concept is no different from conventional SCAO (see Fig. [3]), the technical 
implementation is highly challenging. The wavefront sensing must be performed with a 
small ~ 20 cm aperture spacing to spatially sample the turbulence. This requires a large 
format detector, which must be able to read out the measurements with a low < 1 e~ 
readnoise and a fast > 1 kHz frame rate in order to sample the turbulence temporally. The 
deformable mirror must match the WFS, and have a high stroke so that both atmospheric 
and instrumental perturbations can be fully corrected (as noted in Sec. 12.2.31 one way to 
achieve this is to use 2 DMs corresponding to a woofer and tweeter). Image and pupil 
instability due to thermo-mechanical effects, and non-common path errors must be actively 
minimised using additional internal control loops. On top of these provisions, highly efficient 
reconstruction and filtering algorithms and needed to reduce to < 1 ms the delay between 
measuring and applying a wavefront correction. 

The challenges and rewards of high contrast imaging have led to a mas sive effort in the de- 
velopment of extreme AO systems. The 672 actuator system at the LBT ( Esposito et al. 2010h 
could be considered a forerunner to extreme AO. The first truly high order system, with 
more than 3000 actuators on the DM, is PAL M-3000 for the 5-m te lescope at Palomar Obser- 
vatory, which had its first light in J une 2011 (Dekanv et al. 20111 '). Two dedicated extrem e 
AO instruments will be SPHERE jBeuzit et al. 20061 ) and GPI ^Macintosh et al. 20081 ). 
which are close to commissioning at the VLT and Gem ini observatories. And the AO sys- 
tem on Subaru will soon receive the SCExAO upgrade ( Martinache fc Guvon 20091 ). which 
includes an additional higher order (> 1000 actuator) DM as well as coronographs to better 
serve its high contrast instrumentation. 



5.5 Towards the Visible 



The high performance of extreme AO systems will lead to improvements at visible wave- 
lengths, but only in the few arcsec around relatively bright stars. Currently, the only way 
to achieve Strehl ratios as high as 20% at 700 nm over a wider field using fainter stars 
is through a combinati on of low order AO and image selection, as dem onstrated by the 
LuckyCam experiment ( Law. Mackav fc Baldwin 20061 ; Law et al. 2009a ). The key to this 
technique is to keep only data obtained during the (perhaps brief and intermittent) periods 
when the AO performance was at its best, tradeing depth for resolution. The enabling tech- 
nology is a fast (lOMpixs -1 giving frame rates of 20-50 Hz ) essentially noiseless read-out 
of a large format detector ( Mackav. Baldwin fc Tubbs 20031) . The role of AO is to increase 
the likelihood that the PSF is dominated by a single bright speckle at any given time, and 
hence the fraction of frames above a quality threshold (and with a wide isoplanatic angle) . 
The selected frames are then combined using a shift-and-add alignment. lLaw et al. (2009a) 
found that when using 20% of the frames, the co-added image reached the diffraction lim- 
ited of the Palomar 5-m telescope, and the FWHM and Strehl ratio were both a factor 
2 better than with AO alone. B oth shorter (< 10 ms) exposure times ( Smith et al. 200*91 ) 
and the use of Fourier selection ( Garrel et al. 201ol ) may lead to greater gains. A different 
approach is required to apply lucky imaging to pre-existing instruments, as planned for the 
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Magellan Telescope DSM, which recent tests have shown can yield significant gains in the 
visible with bright guide stars (L. Close, priv. comm.). By including a fast shutter which 
enables real-time 'frame' selection at a rate of ~ 100 Hz, and by using an additional 2 kHz 
'clean-up' tip-tilt system (as a substitute for shift-and-add ) , the aim is to fe ed an integral 
field spectrometer with > 20% Strehl ratio at 0.6-1.0 /im ( Males et al. 201oh . The attrac- 
tion of achieving the diffraction limit at visible wavelengths has also led to rapid growth in 
the number of AO lucky imaging cameras in operation at 2-4 m class telescopes, opening a 
regime in which they can compete effectively with larger telescopes. 



6 Lessons Learned and Future Outlook 

Since the first astronomical AO systems were opened to the community in the early 1990s, 
numerous technical achievements have been accomplished, a multitude of novel techniques 
have been established, and it is now inconceivable to consider building a large telescope 
without AO. Yet many of the advanced concepts still exist only on paper or are just being 
demonstrated in the laboratory. While many of these innovations emerged during the first 
half of the last decade, recent years were mostly dedicated to raising the technological 
readiness level and approaching on-sky demonstration. It is becoming more evident that 
AO is conceptually mature, and it appears that few paths remain which have not yet been 
explored. The largest progress in AO is now to be expected from technological developments 
mainly in the areas of high-density deformable mirrors and powerful real-time computers. 
Also fast and low-noise near-IR detectors have a great potential, while optical detectors with 
sub-electron read-noise and very high quantum efficiency are already close to perfection. 

The demand for AO, and particularly for LGS-AO, is high. For example, the time for 
which SINFONI is scheduled on the VLT is divided roughly equally between LGS-AO, NGS- 
AO, and seeing limited observations (ESO User Support Department). Similarly, for the last 
5 years nearly half of Keck II science nights have been allocated to AO programmes; and of 
the AO science publis hed in 2010, most extragalactic, and about half of galactic, papers had 



made use of the LGS (|Wizinowich 201 If ). In contrast, although scientific exploitation of AO 
is beginning to flourish, it struggled to get started, as reflected by the relative citation rates 
of AO papers in the main peer-reviewed journals. To s ome extent, this growth of scientific 



productivity reflects the level of investment into AO ( Froeel 20061 ). which is a concern if 



the scientific impact of AO is to continue growing. But perhaps a more important lesson 
is that there is a 'threshold' in the science domain both for case studies (does the spatial 
resolution give access to new understanding of a physical process?) and for the statistical 
approach (are there sufficient well-selected targets?). In either case, for the increasingly 
complex systems that are now being developed, it is important to bear in mind that 

there is a vast gulf between technological demonstration and scientific productivity. 

This can be seen in the expectations of early AO systems, which were raised very high 
and led to disappointment within the community and a loss of credibility. Estimates of the 
number of targets that could benefit from AO, based on statistical distributions of classes of 
objects, had been misleading in their optimism. Instead, analysis of telescope pointings (e.g. 
to assess the impact of 'LGS collisions' where scattered light from a laser at one telescope 
falls within the field of view of another) show that astronomers frequently return to the 
same targets and fields. The reason is that observational progress relies heavily on objects 
for which there is ancilliary data and prior knowledge. So far, the science impact of AO 
has benefitted greatly from case studies of relatively few intensively studied objects. In the 
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future, the use of AO on large representative samples will become increasingly important. 
Astronomers need AO to function for those targets, even though they may be demanding 
(e.g. low airmass or faint and far off-axis guide stars). Thus: 

AO ought to be accessible to targets for which the primary selection criteria are astrophysical 
rather than technical. 

One way to achieve this is to better understand the requirements imposed on the AO sys- 
tem by the targets (rather than vice-versa) using appropriate performance metrics. Strehl 
ratio directly conflicts with sky coverage, particularly in fields that have faint or sparse 
guide /tip-tilt stars; and e ncircled energy leaves the issue of spatial resolution completely 
open. 



Davies et al. f2010h suggested encircled energy should be used as a criterion in terms 



of the fractional energy within the core of the PSF, in conjunction with an independent 
specification on the maximum acceptable width of the core. Similar criteria (e.g. encircled 
energy within a pixel) are already used for a few instruments, and they will remain impor- 
tant for the ELTs since a number of the prime science drivers require high (< 100 mas) but 
not diffraction limited resolution. To fully embrace the top level scientific requirements, 

A simple set of AO performance metrics ought to be widely applicable to all types of astro- 
physical targets and science cases. 

AO performance is strongly dependent on atmospheric conditions, which has meant that 
for much of the time it is scientifically ineffective. To address this, the operating parameters 
of the current generation of AO systems can be (automatically) adjusted to match the 
ambient conditions. However, their design specifications still generally only cover the better 
part of the seeing distribution. Queue scheduling, while beneficial, can only provide part of 
the solution to dealing with different atmospheric conditions. Instead: 

AO ought to provide useful performance in moderate to poor atmospheric conditions. 

Coupled to this, especially for deep observations of distant galaxies, is the issue of long 
i ntegrations. These can exceed several hours, and 80 hr integrations are planned for MUSE 
( Bacon et al. 20091 ). To accumulate this in a reasonable time means the AO system must 
maintain its performance during long sequences of exposures despite rapid, large amplitude 
variations in the seeing on short timescales. Analysing turbulence profiles from 7 sites over 
83 nights. iRacine (2009) found that the variability is much stronger on timescales of minutes 
than between nights. And at 10 Hz timescales, seeing can change by as much as a factor of 
2 within a second (C. Mackay, priv. comm.). Following strong variability is demanding for 
AO systems, but the requirement for long integrations means that: 

AO performance ought to mitigate the effects of a highly and rapidly variable atmosphere. 

Even for well designed AO systems, functioning optimally is hard. This is partly due to 
the large number of expected and unexpected opto-mechanical and operational interfaces 
to the telescope and instrument. In the past, some of these (e.g. vibrations) have had 
a detrimental impact on AO performance. To avoid such situations, the telescope, AO 
system, and instrument need to be considered as a joint facility. System engineering of the 
entire instrument-AO-telescope system is therefore a key ingredient, and 

an AO system ought to be designed together with its telescope and instrument. 

Optimal functionality also implies a significant workload to manage and maintain a com- 
plex system at operational readiness. This is particularly so for LGS systems because of 
the additional control loops required, the beam relay, and the complexity of the lasers. 
Finally, performance depends very much on calibration and characterisation. This has a 
major impact on how well one can perform wavefront reconstruction, remove non-common 
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path errors, and compensate optical distortions. These can, for example, directly affect 
astrometric precision; but the impact of AO on astrometry, and how best to calibrate it, 
is still poorly understood ( Trippe et al. 201oh . It is clear that, as the requirements become 
more demanding and AO systems more complex, 

the operational effort to continually achieve optimal performance may be considerable. 

Extracting the best science requires PSF calibration. This is a crucial ingredient in the 
post-processing of AO data, and one that remains a major issue because it has not received 
the attention it needs. Because of the difficulties in empirically estimating the PSF, it 
is important to make use of the knowledge we have of the AO system and atmospheric 
conditions - ideally to provide a full reconstruction of the PSF, but minimally to provide 
some information about its shape and spatial variability. In this way, 

AO systems ought to provide post-processing support by enabling the PSF to be derived. 

The rate of technical progress in AO over the last 2 decades has been astounding; and 
that it can lead to important insights in astrophysical processes cannot be disputed. But if 
we wish to rely increasingly on the ability of AO to boost the performance of current and 
future telescopes, it is crucial that the astronomical and AO communities come together so 
that they can understand their respective requirements and constraints. Only this way can 
we ensure that the scientific productivity and impact of AO continue to grow. 
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